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ABSTRACT 

The Las Campanas IR (LCIR) Survey, using the Cambridge Infra-Red Survey Instru- 
ment]^] (CIRSI), reaches H ~ 2 11 over nearly 1 degree 2 . In this paper we present results 
from 744 arcmin 2 centred on the Hubble Deep Field South for which UBVRI opti- 
cal data are publicly available. Making conservative magnitude cuts to ensure spatial 
uniformity, we detect 3177 galaxies to H = 20.0 in 744 arcmin 2 and a further 842 to 
H = 20.5 in a deeper subregion of 407 arcmin 2 . We compare the observed optical- 
IR colour distributions with the predictions of semi-analytic hierarchical models and 
find reasonable agreement. We also determine photometric redshifts, finding a median 
redshift of ~ 0.55. We compare the redshift distributions N(z) of E, Sbc, Scd and Im 
spectral types with models, showing that the observations are inconsistent with sim- 
ple passive-evolution models while semi-analytic models provide a reasonable fit to the 
total N{z) but underestimate the number of z ~ 1 red spectral types relative to bluer 
spectral types. We also present N(z) for samples of extremely red objects (EROs) 
defined by optical-IR colours. We find that EROs with R — H > 4 and H < 20.5 have 
a median redshift z m ~ 1 while redder colour cuts have slightly higher z m . In the 
^ • magnitude range 19 < H < 20 we find that EROs with R — H > 4 comprise ~18 per 

cent of the observed galaxy population, while in semi-analytic models they contribute 
only ~4 per cent. 

We also determine the angular correlation function w{9) for magnitude, colour, 
spectral type and photometric redshift-selected subsamples of the data and use the 
photometric redshift distributions to derive the spatial clustering statistic as 
a function of spectral type and redshift out to z ~ 1.2. Parametrizing £(r) by 
£(r c ,z) = {r c /r* (z))" 1,8 where r c is in comoving coordinates, we find that r*(z) in- 
creases by a factor of 1.5-2 from z = to z ~ 1.2. We interpret this as a selection 
effect - the galaxies selected at z ~ 1.2 are intrinsically very luminous, about 1-1.5 
magnitudes brighter than L*. When galaxies are selected by absolute magnitude we 
find no evidence for evolution in r* over this redshift range. Extrapolated to z = 0, 
we find r*(z = 0) ~ 6.5 /i _1 Mpc for red galaxies and r*(z = 0) ~ 2-4 /i _1 Mpc for 
blue galaxies. We also find that while the angular clustering amplitude of EROs with 
R — H > 4 or / — H > 3 is up to four times that of the whole galaxy population, 
the spatial clustering length r*(z = 1) is ~7.5-10.5 h~ 1 Mpc which is only a factor of 
~ 1 .7 times r* (z = 1) for R — H < 4 and I — H < 3 galaxies lying in a similar redshift 
and luminosity range. This difference is similar to that observed between red and blue 
galaxies at low redshifts. 

Key words: galaxies: clustering - cosmology: observations - surveys - infrared: 
galaxies - galaxies: photometry - galaxies: distances and redshifts. 
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1 INTRODUCTION 

The recent availability of panoramic near-IR cameras - such 
as the Cambridge Infra-Red Survey Instrument - has opened 
up the window on the 1 < z < 2 Universe. While spec- 
troscopic surveys such as the 2dF Galaxy Redshift Survey 
(Colless et al. 2001), CFRS (Le Fevre et al. 1996), CNOC2 
(Carlberg et al. 2000) and the Caltech Faint Galaxy Redshift 
Survey (Hogg, Cohen & Blandford 2000) probe the z < 1 
Universe and the Lyman-dropout technique (Steidal et al. 
1996; Giavalisco et al. 1998) selects star-forming galaxies at 
z ~ 3, the redshift range 1 < z < 2 has traditionally been 
'hidden' to optical astronomers due to the lack of prominent 
spectral features at optical wavelengths in galaxies at these 
redshifts. In particular, massive evolved red galaxies become 
very faint at optical wavelengths above z ~ 1 as the 4000 A 
break moves out of the I band. In contrast red galaxies re- 
main prominent out to z ~ 2 in the near-IR. Furthermore, 
rather than being sensitive to recent bursts of star forma- 
tion, near-IR luminosity closely tracks total stellar mass over 
these redshifts and is less affected by dust obscuration, mak- 
ing comparisons with models more straightforward than in 
optically-selected surveys. 

Determining the number density of massive evolved 
galaxies at these redshifts is an important test for galaxy 
evolution models. In the traditional passive-evolution model 
(Eggen, Lynden-Bell & Sandage 1962; Sandage 1986), such 
galaxies form in single monolithic collapses at high redshift 
and then evolve passively with no new star formation, thus 
giving rise to a constant comoving density of massive galax- 
ies to high redshifts. On the other hand, in hierarchical 
merger models massive elliptical galaxies formed relatively 
recently from the merging of smaller disc galaxies (White 
& Rees 1978). Thus it has been proposed that at z ~ 1-2 
the two models should give very different predictions for the 
number density of massive evolved galaxies (Kauffmann & 
Chariot 1998). 

Likewise, measurements of the three-dimensional clus- 
tering of galaxies in the redshift range 1 < z < 2 fill an 
important gap in our knowledge of the evolution of large- 
scale structure between z < 1 and z ~ 3 that until recently 
has been filled only by deep pencil-beam surveys such as the 
Hubble Deep Fields (Arnouts et al. 1999; Magliocchetti & 
Maddox 1999) whose small fields of view and small sample 
size inhibit a proper clustering analysis. 

One result of early near-IR surveys (Elston, Rieke & 
Rieke 1988; McCarthy, Persson & West 1992; Hu & Ridg- 
way 1994; Cowie et al. 1994; Moustakas et al. 1997; Barger 
et al. 1999; Thompson et al. 1999) has been the discovery of 
populations of extremely red objects (EROs) defined in the 
literature by various extreme optical-infrared colours, typi- 
cally R—K > 5 or 6 which roughly corresponds to R—H > 4 
or 5 and I — H > 3 or 4. These colours are characteristic of 
elliptical galaxies at z ~ 1 or greater (Fig. ^) - so that the 
4000 A break falls between the optical and the near-IR fil- 
ters, high-redshift highly- reddened dusty star-forming galax- 
ies, obscured AGN and low-mass stars. Deep spectroscopy 
of a few such objects (Dunlop et al. 1996; Graham & Dey 
1996; Cohen et al. 1999; Cimatti et al. 1999; Liu et al. 2000) 
and morphological studies using the Hubble Space Telescope 
(Moriondo, Cimatti & Daddi 2000; Stiavelli & Treu 2000) 
suggest that 70-80 per cent are evolved massive ellipticals at 
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Figure 1. R — H colour as a function of redshift for empirical 
SEDs (Coleman, Wu & Weedman 1980; CWW) E, Sbc, Scd, Im 
and for evolving Bruzual & Chariot (1993; BC) GISSEL' 98 SEDs 
with single stellar population burst, exponentially decaying with 
time-scale r = 1, 3, 5 and 15 Gyr and constant star formation 
rate, all with formation redshift Zf = 10, using an Qo = 0.3, 
= 0.7 and Ho = 70 km s — 1 Mpc — 1 cosmology (see §^j.2 for 
details). The lower panels show BC SEDs with varying amounts of 
reddening (Calzetti et al. 2000). The CWW SEDs have not been 
reddened since they are empirical and intrinsically include some 
reddening. Only the most evolved z > 0.7 or very dust-reddened 
galaxies are picked out by an R — H > 4 colour cut. 



z ~ 1—1.5. Submillimetre observations (Cimatti et al. 1998; 
Smail et al. 1999; Dey et al. 1999), spectroscopy and mor- 
phology show that others, especially the reddest, are z > 1 
dusty starbursts. Initial estimates of the number densities 
of EROs varied considerably (Barger et al. 1999; Thompson 
et al. 1999) showing that wider field surveys would be nec- 
essary to obtain meaningful statistics. This was later con- 
firmed by measurements of very strong angular clustering 
(Daddi et al. 2000a; McCarthy et al. 2000). The strong clus- 
tering supports the interpretation that a majority of these 
EROs are massive galaxies in a narrow redshift range con- 
strained at z ~ 1 by the red colour cut and at z ~ 1.5 by 
the limiting magnitudes of current surveys (and/or spectral 
evolution). However the true significance of the measured 
clustering amplitude was unclear, given that any popula- 
tion of objects at the bright end of the luminosity function 
and in a restricted redshift range may be expected to appear 
strongly clustered. A more informative picture comes from 
the three-dimensional clustering scale which may be inferred 
from angular clustering measurements provided the redshift 
distribution N(z) can be estimated. 

Recognizing the above we began a deep wide-field sur- 
vey (1 degree 2 ) in several optical and near-IR filters to 
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Figure 2. I — H colour as a function of redshift for empirical 
SEDs (Coleman, Wu & Weedman 1980; CWW) E, Sbc, Scd, Im 
and for evolving Bruzual & Chariot (1993; BC) GISSEL' 98 SEDs 
with single stellar population burst, exponentially decaying with 
time-scale r = 1, 3, 5 and 15 Gyr and constant star formation 
rate, all with formation redshift Zf = 10, using an Co = 0.3, 
SIa =0.7 and Ho = 70 km s _1 Mpc -1 cosmology (see §^.2 for 
details). The lower panels show BC SEDs with varying amounts of 
reddening (Calzetti et al. 2000). The CWW SEDs have not been 
reddened since they are empirical and intrinsically include some 
reddening. Only the most evolved z > 0.9 or very dust-reddened 
galaxies are picked out by an I — H > 3 colour cut. 

study evolved galaxies at z > 1 and obtain statistically 
significant samples of EROs. By applying a photometric 
redshifts template-fitting technique (Bolzonella, Miralles & 
Pello 2000) we assign an approximate redshift and best- 
fitting spectral type to each galaxy in the sample, thus 
enabling us to calculate redshift distributions for different 
spectral types and make more detailed comparisons with 
models than are possible with simple colour-selected sam- 
ples. Photometric redshifts may also be used to obtain the 
projected clustering amplitude in redshift shells and, while 
they are not accurate enough to directly measure three- 
dimensional clustering, the estimated redshift distributions 
may be used along with Limber's equation to infer the three- 
dimensional clustering from the measured projected clus- 
tering amplitudes. By dividing the projection axis between 
several redshift shells, the signal-to-noise in clustering mea- 
surements is increased relative to simple imaging surveys 
(see also Brunner, Szalay & Connolly 2000; Teplitz et al. 
2001; Brown, Boyle & Webster 2001). Furthermore, with 
photometric redshifts one can avoid mixing different spec- 
tral types, and intrinsic luminosities while still obtaining 
much larger sample sizes than are currently possible with 
spectroscopic surveys at these redshifts and magnitudes. 



This paper presents results for one of our fields cover- 
ing 744 arcmin 2 in which we have data in the UBVRIH 
filters. The field is slightly larger than that used by Daddi 
et al. (2000a) and has the advantage of enough optical fil- 
ters to determine photometric redshifts for all objects. A 
companion paper (McCarthy et al. 2001) presents red ob- 
ject number counts and clustering results to H ~ 21 for a 
further three fields in which we currently have deep H imag- 
ing but less extensive optical coverage while a third paper 
(Chen et al. 2001) presents a broad overview of the survey 
and initial results. The structure of this paper is as follows. 
In §^ we briefly describe data acquistion and reduction pro- 
cedures. In §0 we describe catalogue generation and charac- 
teristics. In §0 we review the photometric redshift technique 
and present simulations relevant to the particular survey fil- 
ters and S/N characteristics and in §^ we discuss star-galaxy 
separation methods. In §[] we introduce several galaxy evolu- 
tion models: a semi-analytic hierarchical merger model and 
simple no-evolution and passive-evolution models, and in §^ 
we compare number counts, colour-colour, colour-magnitude 
and N(z) distributions between the observations and the 
models. In ^ we present our angular clustering results and 
in §^ we present our spatial clustering results. Finally § |Io| 
and §[ll] contain a discussion and summary. We use the Vega 
magnitude system throughout the paper, spatial clustering 
lengths are expressed using comoving coordinates and we 
write the Hubble constant as Ho = 100/t km s -1 Mpc -1 . 



2 THE OBSERVATIONS 

The Las Campanas IR (LCIR) Survey covers 24 tiles ar- 
ranged in 6 disparate groups at high galactic latitude. Each 
square tile is ~13 arcmin on a side, corresponding to ~ 8.7 
h~ Mpc in comoving coordinates (~ 4.4 h~ 1 Mpc in phys- 
ical coordinates) at z = 1 for an Qo = 0.3, £7a = 0.7 cos- 
mology. In this paper we present results for 6 tiles centred 
on the Hubble Deep Field South (HDFS) at R.A. 02:33:13, 
Dec. —60:39:27. In these tiles we have approximately 80 
minutes per pixel of H imaging (Fig. ^|) obtained with the 
Cambridge Infra-Red Survey Instrument^ (CIRSI) on the 
2.5-m Du Pont Telescope at Las Campanas Observatory 
in September-October 1999 and 2000, and optical UBVRI 
imaging obtained by the Goddard Space Flight Center with 
the Big Throughput Camera (BTC) at Cerro Tololo Inter- 
American Observatory in September 1998 and made pub- 
licly available (Palunas et al.| 2000; see Teplitz et al. 2001 
for a photometric redshift and clustering analysis of the full 
0.5 degrees 2 ). The CIRSI camera (Beckett et al. 1998) has 
4 HgCdTe 1024 x 1024 detector-chips spaced at ~90 per 
cent of a chip width so that stepping the camera four times 
fills in the gaps between the chips. At the Du Pont the 4k 
x 4k mosaic covers 13 x 13 arcmin 2 with a pixel scale of 
~0.2 arcsec per pixel. The BTC has a pixel scale of 0.43 arc- 
sec per pixel. The seeing FWHM in the reduced H images 
is typically 0.9-1.1 arcsec. In the optical images the seeing 
FWHM is typically 1.5-1.6 arcsec. 

* An instrument developed with the support of the Raymond 

and Beverly Sackler Foundation. 

t http: / /hires. gsfc.nasa.gov/ ~research/hdfs-btc/ 
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In the near-IR, high background limits the length of ex- 
posures. We used an exposure time of 45 s, dithering by ~10 
arcsec after each 3-4 consecutive exposures. Between 5 and 
9 dithers were completed at one telescope pointing before 
moving to the next mosaic position. Fields were observed 
over the course of several nights to make up the full 80 min- 
utes per pixel exposure time. Standards stars from Persson 
et al. (1998) were observed on chip 1 periodically throughout 
the night in photometric conditions using a 5 point dither 
pattern. At least once per observing run standards were ob- 
served on all four chips. Data taken during non-photometric 
conditions were either discarded or calibrated to data taken 
during photometric conditions. 

Reduction of the infrared data involved the following 
steps: (1) flat-fielding using the difference of illuminated and 
dark dome flats, (2) subtraction of a sky frame (the running 
median of typically 3 adjacent frames on either side), (3) 
subtraction of the column and row modes (in order to re- 
move electronic ramp effects) , (4) coaddition of the resulting 
frames, (5) detection and masking of objects, (6) repetition 
of steps (2)-(3) with objects masked and (7) final coaddition 
and mosaicing of the resulting frames. Further details of the 
observing strategy and data reduction are given in Chen et 
al. (2001) and Sabbey et al. (2001). 



3 OBJECT CATALOGUES 

We begin by defining our system of measuring object magni- 
tudes. In general one may define an object's magnitude to be 
the sum of the flux within a given aperture, typically defined 
to be some isophote (isophotal magnitudes) or a larger aper- 
ture whose dimensions are designed to enclose essentially all 
of the object's flux (e.g. Kron and Petrosian magnitudes). 
While in principle these magnitudes measure the total flux 
from an object, they also suffer from decreased signal-to- 
noise in the faint wings of extended objects and, further, 
at faint magnitudes the aperture can be poorly defined. An 
alternative approach is to measure the flux for every object 
within some circular aperture of fixed diameter. The disad- 
vantage of this method is that for very extended or nearby 
galaxies a significant fraction of the galaxy's flux falls out- 
side the aperture. However in this paper we are interested 
mainly in determining photometric redshifts, for which the 
important quantity is an as accurate as possible determi- 
nation of each object's colours. In particular, magnitude 
measures which use a different aperture in different filters 
are generally inappropriate. For the observations presented 
here we find that a 3 arcsec diameter aperture provides a 
good compromise between missing flux from smaller aper- 
tures and decreased signal-to-noise in larger apertures while 
being sufficiently large compared with the range of seeing 
FWHM values (0.9-1.6 arcsec over the different filters) to 
be robust with respect to seeing variations across individual 
tiles. 

The fraction of flux scattered outside the aperture de- 
pends on the seeing. In order to compute object colours 
it is necessary to correct for the different seeing FWHM 
in the different filters. We determine this correction from 
bright stars selected in each filter and tile. The object detec- 
tion software package SExtractor version 2.1.6 (Bertin 



& Arnouts | 1996) was used to detect objects on all im- 
ages and between 30 and 60 bright non-saturated stars 
were selected in each filter in each of the 6 tiles using 
SExtractor's neural network star/galaxy separation param- 
eter CLASS_STAR. From these, the median seeing FWHM and 
magnitude offset between 3 arcsec and 10 arcsec diameter 
apertures was determined for each tile in each filter. Given 
the seeing FWHM of these images, a 10 arcsec aperture mag- 
nitude is within 0.02 magnitudes of a total magnitude for a 
point source. For LCIR survey objects, the magnitudes that 
we quote throughout this paper are 3 arcsec aperture mag- 
nitudes corrected to 10 arcsec with the seeing correction for 
the relevant filter. For compact sources these are essentially 
total magnitudes. For extended objects these underestimate 
total magnitudes. 

The degree to which total magnitudes are under- 
estimated was investigated using the irafj^] package 
noao . artdata. This package allows one to place simulated 
galaxies of various light profiles and magnitudes on to an 
image. One can then measure magnitudes using the same 
methods that are used for the data and compare the mea- 
sured magnitudes with the input magnitudes to quantify any 
offset. The estimated flux missed from a 3 arcsec aperture 
is listed, for various object profiles, in Table |l|. As noted 
in the table caption, these values are insensitive to seeing 
provided the relevant seeing corrections are first applied to 
the 3 arcsec aperture magnitudes. At H — 20.0 we expect 
typical half-light radii of ~0.5 arcsec (Smail et al. 1995; Yan 
et al. 1998; Corbin et al. 2000) giving magnitude offsets of 
order 0.1-0.3 at the faint end, depending on the mix of pro- 
files. We emphasis that we do not correct the observations 
for this 'aperture effect' as the half-light radius of a given 
galaxy is poorly determined at the faint end and we are in 
any case mainly interested in object colours. 

A first catalogue was made using SExtractor to detect 
objects on the H image with the threshold for object de- 
tection set at 13 contiguous pixels 1.3er above the local sky 
r.m.s. For comparison, an object with a gaussian profile of 
FWHM equal to the seeing, which registers a la detection in 
a 3 arcsec diameter aperture, will have ~26 (depending on 
the particular tile) pixels 1.3a above the sky r.m.s. Broader 
profiles at the same magnitudes register fewer pixels above 
1.3a, so we also investigated the detection efficiency as a 
function of object profile (see below). The coordinate trans- 
formations between the H images and the optical images 
were calculated using the iraf task geomap. Between 50 and 
200 stars were used in each tile to derive the transformation 
and the resulting r.m.s. was less than 0.2 arcsec. The iraf 
task noao . digiphot . apphot . phot was used to measure 3 
arcsec aperture magnitudes at the coordinates of the object 
centres in each filter. An i/-band detection limit was esti- 
mated in each tile by measuring the sky r.m.s. in a grid 
of apertures over the image (with cr-clipping to remove ob- 
jects). From these values the background noise in 3 arcsec 
apertures and the required fluxes to register la detections 
were estimated at each point in the grid. In each tile the lim- 
iting flux/magnitude was derived from the 95th percentile 



t http:/ /terapix. iap.fr/sextractor/ 
§ http://iraf.noao.edu/ 
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Figure 3. The H-band image with the positions of the masked regions over bright stars marked. Each of the 6 tiles measures roughly 13 X 
13 arcmin 2 . The width of the full image corresponds to ~ 27 h~ 1 Mpc in comoving coordinates (~ 13.5 h~ 1 Mpc in physical coordinates) 
at z = 1 for an Qrj = 0.3, C2a = 0.7 cosmology. Assuming this cosmology and calculating volumes to the median (photometric or 
spectroscopic) redshift of a survey, the volume probed by the HDFS field of the LCIR survey is approximately (40 h _1 Mpc) 3 (z m ~ 0.55). 
This compares with ~ (14 h^Mpc) 3 for the Hubble Deep Field North (Williams et al. 1996) (assuming z m = 1.2) or ~ (200 h^Mpc) 3 
for the 2dF Galaxy Redshift Survey with z m ~ 0.11 (Colless et al. 2001). (Note on astro-ph version: some background structure is due 
to image compression.) 



of these fluxes - i.e. a source at this limiting (aperture) mag- 
nitude is sufficiently bright to register a la or greater detec- 
tion in a 3 arcsec aperture over 95 per cent of the tile's area 
(Fig. ^) . These limits were applied to the catalogue giving 3 
tiles (337 arcmin 2 ) at H < 20.0 and 3 tiles (407 arcmin 2 ) at 
H < 20.5 (where these limiting magnitudes include the see- 
ing correction). While some regions of the survey go deeper 
(see Chen et al. 2001) we make these cuts to ensure a spa- 
tially uniform magnitude limit - making a clustering analy- 
sis more straightforward. 

Since more extended galaxies, with the same total 



magnitude, yield fewer pixels above the 1.3cr threshold 
than compact galaxies, there is an almost inevitable bias 
against extended or low surface brightness galaxies, espe- 
cially at the magnitude limit. We measured the efficiency 
with which SExtractor detects objects using the iraf pack- 
age noao.artdata (see also Gray et al. 2000; Chen et al. 
2001). For a range of magnitudes surrounding the nominal 
limiting magnitude in each iZ-band tile, and for a variety of 
light profiles, 1000 simulated objects were placed onto the 
tile. The fractions recovered, using SExtractor with the de- 
tection parameters described above, are plotted in Fig. At 
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Table 1. Table of the magnitude offsets between the 3 arcsec aperture magnitudes used in this paper and actual total magnitudes for 
various light profiles, as measured with the iraf task noao . artdata. is the projected radius containing half the total flux. Object light 
profiles are convolved with a seeing disc of FWHM 1 arcsec before being placed on to a data image. As with the actual observations, 
object magnitudes are measured in a 3 arcsec diameter aperture then corrected for seeing using the offset from 3 to 10 arcsec aperture 
magnitudes as determined from bright stars in the same image. The resulting 'seeing-corrected' aperture magnitude is compared with the 
input total magnitude. For stars the offset is 0. For extended objects extra flux is missed from a 3 arcsec aperture besides that scattered 
by the seeing disc, leading to non-zero offsets. The same procedure repeated for different seeing FWHM values in the range covered by 
the data (viz. 0.9-1.6 arcsec) produces very similar offsets. 

axis ratio = 0.3 axis ratio = 0.7 axis ratio = 1.0 

7"h (arcsec) stellar exponential disc de Vaucouleurs exponential disc de Vaucouleurs exponential disc de Vaucouleurs 







0.25 0.01 0.14 0.02 0.15 0.02 0.16 

0.50 0.06 0.20 0.09 0.26 0.12 0.30 

0.75 0.15 0.29 0.21 0.38 0.28 0.44 

1.00 0.25 0.38 0.35 0.50 0.46 0.58 

1.50 0.47 0.56 0.66 0.71 0.82 0.80 

2.00 0.68 0.69 0.95 0.87 1.15 0.98 




'17.':. 



Figure 4. We define the limiting H magnitude at a given point 
to be the seeing-corrected H magnitude that corresponds to a 
7a detection in a 3 arcsec aperture centred at that point. We 
determine these limiting H magnitudes on a grid of points across 
each tile and plot the corresponding cumulative fractional area 
versus limiting H magnitude. The six lines correspond to the six 
tiles. The nominal H magnitude limit in each tile is chosen so that 
95 per cent of the tile's area has a limiting magnitude fainter than 
the tile limiting magnitude (dashed lines indicate these limits for 
two of the six tiles). 

the nominal limiting magnitudes, the detection efficiency for 
stellar and compact sources is 90-100 per cent. This drops 
to 70-90 per cent for objects with half-flux radii ~ 0.5 
arcsec. Thus there may be some bias against such objects 
at the faintest magnitudes but not enough to greatly affect 
our results. 

After applying the H magnitude limits to the initial 
catalogue, saturated or nearly saturated objects were also 
deleted from the catalogue and regions around highly satu- 
rated objects were excised, for while bright objects are not 
very prominent in the H image and are unlikely to have 
much effect on the estimated clustering in an ff-selected 
sample, they could affect the photometry of nearby objects 
in the R and I images and hence affect photometric red- 




input H (total) magnitude 

Figure 5. Detection efficiency as a function of input (total) 
magnitude for various light profiles as determined from simula- 
tions using the iraf task noao . artdata. For each magnitude and 
light profile, 1000 simulated objects are placed onto one of the 
LCIR survey ff-band tiles after convolving with the appropriate 
seeing disk (0.9—1.1 arcsec depending on the tile). The fraction 
recovered with SExtractor, using the detection parameters de- 
scribed in is plotted as a function of magnitude. The solid 
lines correspond to the three tiles with a nominal limiting mag- 
nitude of H = 20.0 and the dashed lines correspond to the three 
tiles with a nominal limiting magnitude of H = 20.5. At H = 20.0 
we expect typical half-flux radii of r/ t ~ 0.5 arcsec for which the 
detection efficiency is 70-90 per cent, reaching 90-100 per cent 
for more compact galaxy profiles. 

shifts. Furthermore, regions where object apertures overlap 
the image edges were masked. Every object was inspected 
visually in H and I in order to remove any spurious detec- 
tions. 

The final catalogue contains a total of 5401 objects to 
H = 20.0 and a further 1029 to H = 20.5. A significant 
fraction are however stars (see §^). The majority of objects 
have S/N > 5 in RIH, and > 2.5 in BV. While the extremely 
red objects have low S/N in the optical filters they have 
prominent 4000 A breaks which aid photometric redshifts. 
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4 PHOTOMETRIC REDSHIFTS 

In order to determine spectral types and approximate red- 
shifts for the objects in our survey we use the publicly avail- 
able photometric redshift code hyperzQ (Bolzonella et al. 
2000; BMP). The basic procedure is to compare the mea- 
sured colours of each object with a library of template spec- 
tral energy distributions (SEDs) on a grid of redshifts. The 
best-fitting template SED and redshift are determined by 
minimising 



E SED 



Sbc SED 



Im SED 



fi — ati 



(1) 



where /, is the observed flux in filter i, en is the error in fi, ti 
is the template flux in filter i and a is the scaling factor nor- 
malizing the template to the observed flux. We use the four 
template SEDs of Coleman, Wu & Weedman (1980; CWW) 
that are distributed with hyperz. These have been extended 
into the UV and near-IR with Bruzual & Chariot (1993; 
BC) spectral synthesis models and correspond roughly to 
the spectra of E, Sbc, Scd and Im galaxies. 

The survey filters are displayed in Fig. ^| along with the 
spectral energy distributions (SEDs) of the E (elliptical), 
Sbc (spiral) and Im (irregular) galaxy templates redshifted 
to z = 0, 1, 2 and 3. In our survey fields we have various com- 
binations of the filters UBVRIZJHK though in the field 
used in this paper we only have the filters UBVRIH . This 
set of filters allows reasonable photometric redshift determi- 
nation for all galaxies: spectral-type E galaxies have a promi- 
nent 4000 A break that lies between the survey filters over 
the redshift range of interest; conversely, while spectral- type 
Im galaxies lack prominent spectral features at appropriate 
wavelengths, in an _ff-selected sample they have relatively 
high S/N in the optical filters. 

In any application of photometric redshifts to a galaxy 
survey it is important to investigate the magnitude and char- 
acteristics of photometric redshift errors for the particular 
filter set and limiting magnitudes used in the survey. We 
considered each of the four template SEDs (E, Sbc, Scd, 
Im) and calculated UBVRI magnitudes for H = 20 galax- 
ies on a grid of redshifts from to 3.5. We then added 
random noise in accordance with the noise present in the 
data at the relevant magnitude in each filter. In addition, a 
minimum error term of 0.05 magnitudes r.m.s. was imposed 
since, given calibration errors, aperture correction errors (if 
the seeing FWHM varies over a tile) and other potential 
errors, there is necessarily some error in the photometric ze- 
ropoint of this order. One hundred randomized replicates of 
each SED/redshift combination were created and photomet- 
ric redshifts were estimated for the resulting catalogue. The 
medians of the ranges of offsets Az = 2 modc i — z photomctric are 
very small over the redshift range of interest (Fig. Q). The 
scatter in Az - measured by the half-range of the central 68 
per cent of offsets - is plotted in Fig. ^[ For z < 1.5, which 
is where we expect most of our galaxies to lie, the scatter in 
Az is less than 0.1. Also of interest is the number of galaxies 
of one spectral type that are misidentified as other spectral 
types. This also varies with redshift. These results for the 
above simulation are presented in Fig. 
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Figure 6. Plots of template E (elliptical), Sbc (spiral) and Im 
(irregular) SEDs placed at various redshifts, illustrating the 912 A 
break in blue galaxies and the 4000 A break in red galaxies. The 
survey filters are illustrated below (solid lines represent the filters 
available for this paper, dotted lines represent filters available in 
some of our other fields). 
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Figure 7. The median offsets between model redshifts and pho- 
tometric redshifts for different template SEDs with H = 20 and 
noise added to mimic the observations. Error bars enclose the 
central 68 per cent of offsets. 



These simulations represent an idealised situation - we 
are assuming that real galaxies correspond exactly to the 
template SEDs, and we are assuming that there are no sys- 
tematic or spurious errors in our data. The component of 
photometric redshift errors due to these latter effects may 
be gauged by comparing photometric redshifts with spec- 
troscopic redshifts. Using 139 spectroscopic redshifts (com- 
piled by Fernandez-Soto et al. 2001; see also Fernandez-Soto, 
Lanzetta & Yahil 1999) in the Hubble Deep Field North 
(HDFN) and the F300W, F450W, F606W and F850W 
Hubble Space Telescope imaging (Williams et al. 1996) and 
JHK ground-based imaging (Dickinson et al. in prepara- 
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Figure 8. The scatter in photometric redshifts relative to model 
redshifts - measured by the half-range of the central 68 per cent 
of offsets — for different template SEDs with H = 20 and noise 
added to mimic the observations. The inset shows the expected 
H magnitude of a non-evolving (k-correction only) 2L* E galaxy 
(assuming M* = —23.1 + 51ogh in the K band (Gardner et al. 
1997)) as a function of redshift with the two limiting H magni- 
tudes used in this paper (H < 20.0 and H < 20.5) marked. The 
relatively poor performance with respect to the E SED at z ~ 2 
is due to the 4000 A break moving completely out of the optical 
filters so that there is too little information to pin-point its posi- 
tion. The addition of Z and J filters would remove this problem. 
At 2 < 1.5 — where we expect most of the objects in this paper 
to lie — the scatter in Az is less than 0.1. 
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Figure 9. The number of galaxies misidentified as a different 
spectral type as a function of redshift for different template SEDs 
with H = 20 and noise added to mimic the observations. In the 
majority of cases the spectral type is identified correctly, and the 
few mistakes are generally within one spectral type class of the 
correct class. 



tior|]), the r.m.s. in 1+ ^ , where A z = z spcc - z phot , is 
~ 0.07 (see also BMP; Hogg et al. 1998; Connolly, Szalay 
& Brunner 1998; Arnouts et al. 1999). Since the S/N in 
the HDF data is very high, this value gives an indication of 
the amount of scatter that arises from intrinsic differences 
between the template SEDs and the SEDs of observed galax- 
ies. In the LCIR survey there are fewer filters and lower S/N 
data. Fig. [l^ compares photometric and spectroscopic red- 
shifts in the HDFN using just the optical F 300W, F450W, 
F606W and F850W and near-infrared H filters (i.e. one 
fewer filter than used in the LCIR survey HDFS field). The 
r.m.s. in A z , for z < 1.5, is ~0.14. Since our HDFS field over- 
laps redshift surveys in the Hubble Deep Field South (Cris- 
tiani et al. 2000 and references therein; Glazebrook et al. in 
preparation^) we may directly compare our photometric 
redshifts, using the LCIR survey photometry, with spectro- 
scopic redshifts (Fig. 0). The r.m.s. in A z , for z < 1.5, is 
~0.08. The inset plots in Fig. show that the spectroscopic 
sample includes faint galaxies up to the survey limiting mag- 
nitude and there is no evidence for a significant increase in 
photometric redshift errors for fainter galaxies. This error es- 
timate assumes that the HDFS spectroscopic identifications 
are all correct and therefore may in fact be pessimistic. 

In summary, we expect our photometric redshift errors 
to be small on the scale of the 0.5 bins in redshift that we 
adopt in of this paper, and small enough not to greatly 
affect the results of §[7|-pl (see also these sections for Monte 
Carlo simulations of the errors). 



5 STAR/GALAXY SEPARATION 

Separation of stars from galaxies is important if one hopes 
to compare the observations with model galaxy catalogues 
but it is even more important for accurately calculating the 
correlation function. Since stars are uncorrelated with galax- 
ies, any contamination from stars will reduce the correlation 
signal - the reduction is a factor of (1 — f) 2 where / is the 
stellar contamination fraction (Roche et al. 1997). There are 
two obvious approaches open to us. Firstly, SExtractor uses 
a neural net and the measured seeing FWHM to morpho- 
logically classify stars, outputting the parameter CLASS_STAR 
ranging from (galaxies) to 1 (stars). Separation is reason- 
ably clear-cut at bright magnitudes but less so towards the 
magnitude limit. Secondly we can use photometric template 
fitting. To do this we modified hyperz to accept 173 stellar 
templates from the Bruzual, Persson, Gunn & Stryker stellar 
atlasp| Objects are then classified photometrically accord- 
ing to the best-fitting template - either stellar or galactic. 
This method also suffers towards the magnitude limit. We 
much prefer the morphological approach: since particular 
galaxy SED/redshift combinations are photometrically con- 
fused with stars (and vice versa) more than other combina- 
tions (e.g. Fig. U^j, the use of photometric star/galaxy sep- 
aration would introduce excessive stellar contamination in 



II http:/ /www. stsci.edu/ftp/science/hdf/clearinghousc/ 
irim/iri m Jidf.html 
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© 20?? RAS, MNRAS 000, 




12 3 4 5 

spectroscopic redshift 
Figure 10. A plot of photometric versus spectroscopic redshifts 
in the HDFN. The spectroscopic redshifts come from the com- 
pilation of Fcrnandez-Soto et al. (2001; Table 2) omitting those 
galaxies with presumed incorrect spectroscopic redshifts as de- 
tailed in their Table 6. We use the photometry of Fcrnandez-Soto 
et al. (1999) but rcsticting to only the F300W, F450VK, F606W 
and F850W optical and H near-infrared filters (cf. UBVRIH in 
the LCIR survey). The r.m.s. in A 2 , where A 2 — Zspcc — -^photi 
is 0.14 for z < 1.5. We note that if the H filter is also omitted 
then the photometric redshift errors become substantially larger 
in this region. The increased scatter beyond z ~ 2 is largely due 
to the combination of wavelength range covered by the filter set 
and our decision not to include any very blue (starburst) tem- 
plates (cf. e.g. Massarotti et al. 2001). In this paper we are not 
concerned with galaxies beyond z ~ 1.5 so we prefer to use the 
simpler set of 4 CWW template SEDs. 



particular SED/redshift bins and lead to possibly severe in- 
accuracies in the correlation function. Admittedly compact 
galaxies could be lost along with stars in a morphological 
classification but, in this paper, we consider this the lesser 
of two evils. 

In fact in most cases there is good agreement be- 
tween the two methods, for example Fig. [L3 compares 
SExtractor's morphological classification in H with pho- 
tometric classification. However a small proportion of pho- 
tometrically classified stars have extended H images even 
at relatively bright magnitudes. Since most of the objects in 
our //-selected catalogue are well-detected in VRI, we adopt 
the criteria that an object is classified as a star if CLASS_STAR 
> 0.95 in any one of VRIH and the mean CLASS.STAR in 
those of these filters in which the object is detected is greater 
than 0.5. With these criteria the morphological and photo- 
metric classifications agree in 87 per cent of cases. Fig. |l4| 
shows the star and galaxy number counts as a function of 
H magnitude using the two classifiers. 

The combination of the two techniques allows us to 
roughly estimate the total population of stars and hence 
gauge the remaining stellar contamination or the number of 
galaxies discarded as stars. Using the H data, which have 
the best seeing (1 arcsec), we select a sample Gl of galaxies 
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spectroscopic redshift 

Figure 11. A plot of photometric versus spectroscopic redshifts 
in the HDFS. Only the LCIR survey UBVRIH data (i.e. the 
data used in the rest of this paper) has been used. The r.m.s. 
in Az, where A 2 = z S p CC — -^photi is 

0.08 for z < 1.5. The inset 
at lower right shows a histogram of the H magnitudes of these 
objects — all of which lie in the H < 20 region of the LCIR survey 
- showing that the spectroscopic sample includes a fair selection 
of faint galaxies. The inset at upper left plots the photometric 
redshift errors as a function of magnitude, showing that there is 
no indication of an increase in photometric redshift error to fainter 
magnitudes. In fact the largest photometric redshift error occurs 
for a galaxy with H = 17.7, which may indicate an incorrect 
spectroscopic redshift (and a consequent reduction in the true 
r.m.s. photometric redshift error). 



with CLASS_STAR < 0.10 in H, a sample SI of stars with 
CLASS_STAR > 0.90 in H and a sample S2 of stars with 
CLASS_STAR > 0.95 in H. These criteria leave many objects 
unclassified but we can be fairly confident that nearly all of 
the selected objects are classified correctly. Within the sam- 
ples Gl, SI and S2 we calculate the fraction that are pho- 
tometrically misidentified. Now, assuming that the fraction 
of photometric misidentifications is reasonably independent 
of morphology, we use these fractions to invert the number 
of photometrically classified stars and galaxies to estimate 
the actual number of stars and galaxies. The estimates us- 
ing the samples Gl, SI and Gl, S2 are plotted in Fig. |l5[ 
Also plotted are the star counts in SI and S2 and the star 
counts in the VRIH morphological classification that we ac- 
tually use (i.e. CLASS.STAR > 0.95 in any one of VRIH and 
the mean CLASS_STAR over these filters is greater than 0.5) 
which we label S3 in this section. Clearly SI and S2 miss 
many of the stars but S3 consistently exceeds the estimated 
total number of stars by about 15 per cent. 

We use S3 throughout this paper in order to err on 
the side of caution in removing stellar contamination but it 
must be remembered when comparing with models that the 
galaxy counts may be depleted at the 15 per cent level. Out 
of 5401 H < 20.0 objects we identify 2224 stars. We expect 
the remaining stellar contamination in our galaxy sample to 
be of the order of a few percent or fewer. After removing 
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Figure 12. On the right are shown the best-fitting photometric 
redshifts and galaxy spectral types to objects that are photomet- 
rically identified as stars. This illustrates that there are certain 
galaxy spectral types and redshifts with which stars are more 
likely to be confused (and vice versa). Using photometric tech- 
niques to separate stars may lead to excess stellar contamination 
in, for example, the sample of z < 0.7 spectral-type E galax- 
ies with a corresponding misestimate of their correlation func- 
tion. Using morphological separation techniques avoids this bias. 
On the left are shown the best-fitting photometric redshifts and 
galaxy spectral types to objects that are morphologically iden- 
tified as stars. This illustrates the actual SED/redshift distribu- 
tion of potential stellar contamination, if photometric separation 
techniques were used, based on the actual distribution of stel- 
lar spectral types in the observations. Since our photometric and 
morphological separation techniques agree in 87 per cent of cases 
the plots are in fact very similar. However to avoid any remain- 
ing bias we only use morphological star/galaxy separation in this 
paper. 



stars morphologically we then proceed to fit the remaining 
objects only to the grid of galaxy SEDs. 



6 GALAXY EVOLUTION MODELS 

In order to have a basis upon which to compare and interpret 
the observations, we have generated several model galaxy 
catalogues - a semi-analytic hierarchical merger model and 
several no-evolution and passive-evolution models. In this 
section we briefly describe these models. 



6.1 Semi-analytic hierarchical merger models 

The hierarchical paradigm of galaxy formation, set within 
the framework provided by the Cold Dark Matter theory 
of structure formation (e.g. Blumenthal et al. 1984), has 
proven to be very successful at reproducing many observed 
properties of galaxies. Testing and refining this paradigm is 
one of the major goals of modern observational cosmology. 



Figure 13. The H-band SExtractor star/galaxy classification 
parameter CLASS_STAR as a function of H magnitude for objects 
that are photometrically best-fitted by a stellar template (upper 
left) and objects that are photometrically best-fitted by a galaxy 
template (lower left). The panels on the right show histograms 
summed over magnitude. Even at bright H magnitudes a small 
proportion of photometrically-identified stars have extended mor- 
phologies. Hence our photometric star/galaxy separation is too 
unreliable to use for clustering analyses, where colour-dependent 
residual stellar contamination can differentially bias the cal- 
culated correlation function in different spectral-type/redshift- 
selected galaxy subsamples. We point out that this is not neces- 
sarily true of other surveys as photometric classification depends 
crucially on the particular filter set and S /N characteristics of the 
survey. To avoid this bias we use morphological techniques only 
to separate stars and galaxies in this paper. 
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Figure 14. The total number of objects identified (a) as galaxies 
and (b) as stars using (i) image morphology in VRIH and (ii) 
photometric template fitting. Error bars are Poisson. 
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Figure 15. Intrinsic star number counts (solid line and symbols) 
as a function of H magnitude in the HDFS field of the LCIR 
survey, as estimated from the number counts of photometrically- 
identified stars and galaxies and the estimated error rate in pho- 
tometric star/galaxy classification. Also plotted are the number 
of stars selected using the morphological criteria CLASS_STAR > 
0.90 in H (SI), CLASS.STAR > 0.95 in H (S2) and CLASS.STAR > 
0.95 in any one of VRIH (S3). The latter (long dashes) consis- 
tently exceeds the estimated intrinsic star counts by about 15 per 
cent. We use S3 in this paper in order to err on the cautious side 
in removing stellar contamination. 



Semi-analytic modelling is an attempt to use sim- 
ple recipes to parameterize the main physical processes 
of galaxy formation within the hierarchical paradigm (for 
an introduction see Kauffmann, White & Guiderdoni 1993; 
Cole et al. 1994; Somerville 1997; Kauffmann et al. 1999a; 
Somerville & Primack 1999 (SP); Cole et al. 2000 and ref- 
erences therein). Monte Carlo techniques can be used to ef- 
ficiently produce mock galaxy catalogues representing large 
volumes of space, and can be run with effectively arbitrary 
'mass resolution'. In addition to model spectra, magnitudes, 
and colours, this approach provides predictions of many 
physical properties of the galaxies, such as the distribution 
of stellar ages and metallicities, stellar and cold gas mass, 
bulge-to-disc ratio, etc. The free parameters of the mod- 
els are typically set by requiring that fundamental observed 
properties of nearby galaxies (such as luminosity functions, 
gas content, etc.) be reproduced at redshift zero. The nor- 
malized models can then be used to predict galaxy properties 
at any desired redshift. 

We use the current version of the code developed by 
Somerville (1997), which has been shown to produce good 
agreement with many properties of local (SP) and high- 
redshift (Somerville, Primack & Faber 2001; SPF) galax- 
ies. We now briefly summarize the main ingredients of the 
models, and specify how they differ from the previously pub- 
lished models of SP and SPF. 

The formation and merging of dark matter haloes as a 
function of time is represented by a 'merger tree', which we 
construct using the method of Somerville & Kolatt (1999). 
The number density of haloes of various masses at a given 
redshift is determined by an improved version of the Press- 
Schechter model (Sheth & Tormen 1999), which mostly cures 



the usual discrepancy with N-body simulations. The cooling 
of gas, formation of stars, and reheating and ejection of gas 
by supernovae within these haloes is modelled by simple 
recipes. Chemical evolution is traced assuming a constant 
yield of metals per unit mass of new stars formed. Metals 
are cycled through the cold and hot gas phase by cooling 
and feedback, and the stellar metallicity of each generation 
of stars is determined by the metal content of the cold gas at 
the time of its formation. All cold gas is assumed to initially 
cool into, and form stars within, a rotationally supported 
disc; major mergers between galaxies destroy the discs and 
create spheroids. New discs may then be formed by subse- 
quent cooling and star formation, producing galaxies with a 
range of bulge-to-disc ratios. Galaxy mergers also produce 
bursts of star formation, according to the prescription de- 
scribed in SPF. Thus the star formation history of a single 
galaxy is typically quite complex and is a direct consequence 
of its gas accretion and merger history and its environment. 

These star formation histories are convolved with stellar 
population models to produce model spectra or to calculate 
magnitudes and colours. We have used the multi-metallicity 
stellar population synthesis models of Devriendt, Guider- 
doni & Sadat (1999) with a Salpeter IMF to calculate the 
stellar part of the spectra. The effect of dust extinction is 
modelled using an approach similar to that of Guiderdoni 
& Rocca-Volmerange (1987) and Devriendt & Guiderdoni 
(2000). Here, the face-on optical depth t\ of a galactic disc 
is assumed to be proportional to the column density of met- 
als in the cold ISM: 



t\ = 



\A v J Zq \Zq J 



(Nh) 



2.1 x 10 21 cm 2 



where the mean column density of cold gas is 



{Nb) = 



AL 



(2) 



(3) 



The gas truncation radius r t is taken to be 3.5 times the 
modelled exponential disc scale-length. This produces an av- 
erage B-band face-on optical depth of tb = 0.8 for L, spi- 
ral galaxies, in agreement with observations (e.g. Wang & 
Heckman 1996). The quantity A\/Ay is the solar metallicity 
extinction curve, which we take to be the standard Galactic 
extinction curve given by Cardelli, Clayton & Mathis (1989). 
The power-law scaling with the gas metallicity is intended 
to account for the metallicity dependence of the shape of 
the extinction curve (see Guiderdoni & Rocca-Volmerange 
1987). We assign random inclinations to the galaxies and 
use a 'slab' model to compute the extinction as a function 
of inclination (see Somerville 1997 or Somerville et al. 2001b 
for details). Note that spheroids are assumed to be dust-free. 

As described in SP, we set the free parameters of the 
models by reference to a subset of local galaxy data; in par- 
ticular, we require a typical L, galaxy to obey the observed 
/-band Tully-Fisher relation and to have a gas fraction of 
0.1 to 0.2, consistent with observed gas contents of local 
spiral galaxies. If we assume that mergers with mass ratios 
greater than ~ 1:3 form spheroids, we find that the models 
produce the correct morphological mix of spirals, SOs and 
ellipticals at the present day (we use the mapping between 
bulge-to-disc ratio and morphological type from Simien & 
de Vaucouleurs 1986). This critical value for spheroid for- 
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mation is what is predicted by N-body simulations of disc 
collisions (cf. Barnes & Hernquist 1992). 

In this paper, we assume the currently favoured ACDM 
cosmology with Qo = 0.3, S7a = 0.7, and Hq = 70 km 
s _1 Mpc -1 . It was shown in SP that with the proper 
choice of values for the parameters controlling the star 
formation, feedback, and chemical yield, this model pro- 
duces reasonable agreement with the observed B-band 
and if-band luminosity functions, Tully-Fisher relation, 
metallicity-luminosity relation, and optical colours of local 
galaxies. We use the same fiducial model here, with a few mi- 
nor modifications: we incorporate self-consistently the mod- 
elled metallicity of the hot gas in the cooling function, and 
use the multi-metallicity SEDs (instead of solar metallicity) 
with a Salpeter (instead of Scalo) IMF. Another minor detail 
is that material ejected by supernovae feedback is eventually 
returned to the haloes as described in the updated models 
of SPF. We find that these minor modifications do not sig- 
nificantly change our previous results for local galaxies. An 
updated set of predictions for local galaxy luminosity func- 
tions and galaxy counts will be presented in Somerville et 
al. (2001b). 

We note that the model is in no way tuned to fit the 
results of the LCIR survey. 

6.2 No-evolution and passive-evolution models 

While the semi-analytic model is a physically-motivated at- 
tempt to trace the initial conditions following the Big Bang 
to the galaxy population observed today, a more traditional 
approach (e.g. Tinsley 1977; King & Ellis 1985; Yoshii & 
Takahara 1988; Pozzetti, Bruzual & Zamorani 1996 (PBZ); 
Gardner 1998 (G98)) has been to take the z — galaxy pop- 
ulation and extrapolate it backwards in time. These are com- 
monly referred to as no-evolution (where the galaxy popula- 
tion remains unchanged with redshift) and passive-evolution 
or pure luminosity evolution (PLE) (where galaxies are as- 
signed some formation redshift and star formation history 
and their spectral energy distributions evolve accordingly). 
Such models have been used to model galaxy number counts 
and have led to, for example, the identification of the so- 
called faint blue galaxy problem - where it was noted that 
the observed B-band number counts displayed an excess of 
faint blue galaxies relative to no-evolution models but could 
be fitted by PLE models. These PLE models subsequently 
failed to fit the observed redshift distribution of these galax- 
ies which led PLE proponents to make various refinements 
such as non- number conservation (merging), fading dwarf 
galaxies, dust modelling and new cosmologies. While recent 
PLE models have been successful at fitting optical counts 
and redshift distributions (PBZ; but see also Pozzetti et al. 
1998) there have been discrepancies in the near-IR redshift 
distributions (PBZ; Kauffmann & Chariot 1998) and a no- 
evolution model has often produced a better fit to near-IR 
number counts (PBZ; Metcalfe et al. 1995; Chen et al. 2001; 
but see also Totani 2001a). Such models have also been com- 
pared with ERO number counts (Zepf 1997; Daddi, Cimatti 
& Renzini 2000b). It is useful therefore to reassess the abil- 
ity of these models to fit the near-IR data using the number 
counts, ERO counts, and photometric redshift distributions 
of the large data set presented in this paper. 

Following PBZ, G98 and others, we take a local lumi- 



nosity function (LF) divided into spectral types. For each 
spectral type, the LF and our adopted fio = 0.3, Qa = 0.7, 
Ho — 70 km s" 1 Mpc -1 cosmology give the galaxy distri- 
bution function dN = N{z,Mp)dzdMF where Mf is the 
rest-frame absolute magnitude in the filter used to define 
the LF. Fitting a particular SED to each spectral type, we 
use hyperz software (BMP) to calculate the observed magni- 
tude in each survey filter for a galaxy of given redshift and 
absolute magnitude Mf- Integrating dN over z gives the 
number counts as a function of magnitude in each filter. In- 
tegrating over Mf up to the faint limit imposed at each z by 
the limiting magnitude in the survey filter gives the redshift 
distribution for galaxies brighter than the survey limit. The 
normalization of the number counts comes directly from the 
LFs' (j) t rather than normalizing to observed galaxy number 
counts in some apparent magnitude range (cf. PBZ). 

We used the Bj-band LF of Madgwick et al. (2001 
(M01); see also Folkes et al. 1999) based on 75000 galax- 
ies in the 2dF Galaxy Redshift SurveyP], This LF has been 
divided into 4 spectral types based on a principal compo- 
nent analysis of the galaxy spectra. These spectral types 
(1-4) correspond approximately to the morphological types 
E/SO/Sa, Sb, Scd and Scd (again) of Kennicutt (1992). For 
our no-evolution model we used the same extended CWW 
SEDs described in §§, fitting CWW E to type 1, CWW 
Sbc to type 2, CWW Scd to type 3 and CWW Im to 
type 4. In this model the SEDs are fixed with respect to 
redshift. For our passive-evolution model the SEDs evolve 
with redshift. Here we used Bruzual & Chariot (1993; BC) 
GISSEL'98 models with Miller & Scalo (1979) initial mass 
function and solar metallicity. We fitted the M01 types 2 
and 3 with BC models with exponentially decaying star for- 
mation rates (SFR) with time-scales r = 5 (Sb) and 15 (Sc) 
Gyr respectively and we fitted the M01 type 4 to a BC 
model with continuous star formation (Im). As we are par- 
ticularly interested in the evolution of red galaxies in this 
paper, we used two alternative models for the M01 type 1 
- one with r = 1 Gyr (E) and the second being a single 
stellar population (SSP) burst (B). We used a formation 
redshift Zf = 10 for all these models, and to improve the fit 
of these spectra to empirical spectra at z = 0, dust redden- 
ing modelled by a uniform dust screen following a Calzetti 
et al. (2000 (CABKKS); also Calzetti, Kinney & Storchi- 
Bergmann 1994) extinction law with Ay = 0.3 was applied 
to all of the synthetic spectra. The passive-evolution model 
(using the SSP burst for type 1) provides a good fit to the 
compilation of deep B-band counts in Metcalfe et al. (1995). 

Since using the B./-band M01 LFs to model the num- 
ber counts and redshift distributions in an //-selected survey 
involves extrapolating over a factor of four in wavelength, 
we generated further no-evolution and passive-evolution 
models based on the Tf-band LF of Gardner et al. (1997 
(G 97); see also Gardner's own number count model ncmod 



at http://hires.gsfc.nasa. gov. /~gardner/ncmod/). This LF 
is not divided into spectral or morphological types but fol- 
lowing G98 we divide the normalization 0* between five 
spectral types in the ratio 0.51:0.36:0.1:0.03:0.004. In the 
no-evolution model we match these five types to the CWW 
types E, E, Sbc, Scd and Im respectively and in the passive- 
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evolution model we match the five types to the BC types E 
or B, Sa, Sb, Sc and Im as described above, where Sa is an 
extra type with exponentially decaying star formation rate 
with time-scale r = 3 Gyr. The k- and e- corrections are in 
any case much less dependent on spectral type when con- 
verting from rest-frame K magnitudes to observed-frame H 
magnitudes than when converting from the £>./-band so the 
results (at least the total //-band number counts) are not 
very sensitive to the exact spectral type mix. In the follow- 
ing section we find that the models based on the /("-band LF 
generally provide a better fit to the observations than those 
based on the _Bj-band LFs. 



7 COMPARISONS BETWEEN 
OBSERVATIONS AND MODELS 

We begin (§|^.l) by comparing colour-colour and colour- 
magnitude diagrams between the observations and the vari- 
ous models. This is a useful way to visualize the data - show- 
ing the relative positions of different populations and the 
effect of redshift and reddening. In §[?].2 we compare number 
counts, redshift distributions and ERO statistics and dis- 
cuss how these are consistent with or contradict the various 
models. 

As rioted in §^ we have generally used aperture mag- 
nitudes for the observations in this paper. This complicates 
direct comparison with models, which generate total mag- 
nitudes. Ideally one would like to compare the observations 
with simulated images generated by adding sky and photon 
noise to simulated galaxies, however our semi-analytic model 
does not produce a scale-length for the bulge component nor 
does our passive-evolution model include a formalism for the 
evolution of scale-length so we are unable to properly take 
this approach at present (cf. Totani et al. 2001a). Again 
as noted in §^ the difference between total magnitudes and 
seeing-corrected 3 arcsec aperture magnitudes is expected 
to be of the order of 0.1-0.3 magnitudes at the faint limit 
of this survey. In most of this section we account for this 
by comparing both H < 20.0 and H < 20.5 selected data 
catalogues with H < 20.0 selected model catalogues. The 
real comparison is somewhere between the two. We make an 
exception for our number counts comparison, where for each 
object we use the brightest of isophotal and seeing-corrected 
3 arcsec aperture magnitudes since here the increase of the 
aperture effect towards brighter magnitudes (objects that, 
on average, have greater angular extent) becomes important 
while at faint magnitudes isophotal magnitudes will under- 
estimate fluxes. 

For further analyses of the multicolour catalogue - in- 
cluding other LCIR survey fields - see Chen et al. (2001) 
and Marzke et al. (in preparation). 

7.1 Colour distributions 

We begin by comparing colour-colour and colour-magnitude 
diagrams. Fig. [l(] and Fig. ^compare respectively BVI and 
VRH colours for H < 20.0 LCIR survey galaxies with semi- 
analytic model galaxies and LCIR survey stars with stars 
from the Bruzual, Persson, Gunn & Stryker (BPGS) stel- 
lar atlas and with Kurucz (1993) stellar atmosphere mod- 
elsf^kampled through the survey filters. The Kurucz mod- 
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Figure 16. A comparison of BVI colours of LCIR survey galax- 
ies with the semi-analytic model and of LCIR survey stars with 
stars from the Bruzual, Persson, Gunn & Stryker (BPGS) stellar 
atlas and Kurucz stellar atmosphere models. Both LCIR survey 
and semi-analytic model have been cut at H = 20.0. The Ku- 
rucz models cover a range of temperatures from 3500 to 50000 K, 
metallicities \og(Z / Zq) = 1, and —5, and surface gravities logg 
= 0, 2.5 and 5. The number of points in the semi-analytic model 
has been scaled to match the observations. 

els cover a range of temperatures - 3500, 4500, 6000, 9000, 
12000, 20000 and 50000 K, metallicities - log(Z/Z ) = 1, 
and —5, and surface gravities - logy = 0, 2.5 and 5. Agree- 
ment between the stellar tracks provides a check on the data 
zeropoint calibration. The agreement between the observa- 
tions and the semi-analytic model is fair over the main body 
of the distribution, but it is noted firstly that there is con- 
siderable scatter in the observations (the photometric errors 
are mostly < 0.2 magnitudes in RIH but up to 0.4 magni- 
tudes in BV) and secondly that the observations include a 
component of red and extremely red objects not present in 
the semi-analytic model. In addition, there is much greater 
scatter in the V — R colours of red R — H objects in the ob- 
servations than in the semi-analytic model. In Fig. [l8|, which 
compares V — H, R — H and I — H colours as a function of H 
magnitude for LCIR survey stars and galaxies and galaxies 
in the semi-analytic model, the extremely red R — H > 4 
and I — H > 3 galaxies are very evident in the observations 
but are largely absent from the semi-analytic model. In Fig. 
[is) and Fig. ^ we compare the positions in BVI and VRH 
colour-colour space of different photometrically-determined 
spectral types in the catalogue showing that the red objects 
are mostly fitted by the E template, even those with quite 
blue B — V and V — R colours. 

In order to interpret the colour-colour diagrams it is in- 
structive to plot as a function of redshift the tracks of various 
template SEDs from the no-evolution and passive-evolution 
models described in §^.2. Fig. pl| and Fig. ^ show BVI and 
VRH diagrams of observed galaxies with (a) the tracks of 
four CWW SEDs (E, Sbc, Scd and Im) over-plotted, (b) the 
tracks of six BC SEDs over-plotted, specifically one SED 



§§ See STSDAS: |ittp://ra.stsci.cdu/About.h~ 
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Figure 17. A comparison of VRH colours of LCIR survey galax- 
ies with the semi-analytic model and of LCIR survey stars with 
stars from the Bruzual, Persson, Gunn & Stryker (BPGS) stellar 
atlas and Kurucz stellar atmosphere models. Both LCIR survey 
and semi-analytic model have been cut at H = 20.0. The Ku- 
rucz models cover a range of temperatures from 3500 to 50000 K, 
metallicities \og(Z / Zq) = 1, and —5, and surface gravities logg 
= 0, 2.5 and 5. The number of points in the semi-analytic model 
has been scaled to match the observations. 
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Figure 18. Comparison of V — H, R — H and I — H colours as 
a function of H magnitude for LCIR survey stars, LCIR survey 
galaxies and galaxies from the semi-analytic model. The number 
of points in the semi-analytic model has been scaled to match the 
observations. 



with constant SFR, one SED with an SSP burst and four 
SEDs with exponentially decaying SFRs with time constants 
r = 1, 3, 5 and 15 Gyr, assuming a fixed redshift of formation 
Zf = 10 and the cosmology firj = 0.3, Qa = 0.7, Ho — 70 km 
s _1 Mpc -1 and (c) the same BC SEDs but with an firj — 1, 
JIa = cosmology. The latter tracks evolve more rapidly to 
higher redshift as there is less time in an Qo = 1 universe. 
Unit redshift intervals are labelled. These diagrams illustrate 
that a possible interpretation of the extremely red objects 
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Figure 19. BVI colours for H < 20.0 LCIR survey galaxies for 
different photometrically-determined spectral types. 



Figure 20. VRH colours for H < 20.0 LCIR survey galaxies for 
different photometrically-determined spectral types. 



are z ~ 1 CWW E or BC burst models while the red objects 
with bluer B — V and V — R colours could be the passively 
evolving progenitors of red galaxies seen at z ~ 1-2 with 
some residual star formation (somewhere between BC SSP 
burst and r = 1 Gyr models). The addition of some dust 
reddening to the BC models (as indicated by the arrows) 
is probably necessary to match observed galaxies. Adding 
large amounts of dust reddening (Av 3> ~2, depending on 
the reddening law adopted) could move later spectral types 
into the ERO region of colour-colour space. The blue tail 
seen in the semi-analytic model in the BVI diagram corre- 
sponds to 2 ~ 1.5-2 blue spectral types - galaxies which are 
missing, or redder, in the observations. 

The only consequences of the aperture effect in these 
plots is the inclusion of extra objects in the semi-analytic 
model around the H — 20.0 limit, and a small shift in the 
colour-magnitude plots. 
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Figure 21. BVI tracks of (a) the four empirical CWW SEDs: 
E, Sbc, Scd and Im, and (b, c) six BC GISSEL'98 SEDs (single 
stellar population (SSP) burst, exponentially decaying star for- 
mation with time-scale r = 1, 3, 5 and 15 Gyr and constant star 
formation rate (SFR), all with formation redshift zj = 10) in two 
cosmologies plotted on the LCIR survey galaxies. Unit redshift in- 
tervals are marked and the effect of applying Ay = 1.0 reddening 
(CABKKS) is indicated by arrows. At Z ~ 1 the SED towards 
the right corresponds to the E or SSP burst galaxy while that 
towards the left corresponds to the Im or constant SFR galaxy. 
Note that above z ~ 1.5 the CWW E SED becomes increasingly 
unrealistic as there is limited time for such a galaxy to form. 



Figure 22. VRH tracks of (a) the four empirical CWW SEDs: 
E, Sbc, Scd and Im, and (b, c) six BC GISSEL'98 SEDs (single 
stellar population (SSP) burst, exponentially decaying star for- 
mation with time-scale r = 1, 3, 5 and 15 Gyr and constant star 
formation rate (SFR), all with formation redshift zt = 10) in two 
cosmologies plotted on the LCIR survey galaxies. Unit redshift in- 
tervals are marked and the effect of applying Ay = 1.0 reddening 
(CABKKS) is indicated by arrows. At z ~ 1 the SED towards 
the right corresponds to the E or SSP burst galaxy while that 
towards the left corresponds to the Im or constant SFR galaxy. 
Note that above z ~ 1.5 the CWW E SED becomes increasingly 
unrealistic as there is limited time for such a galaxy to form. 
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7.2 Number counts and redshift distributions 

While the previous subsection was a general overview of the 
data, in this subsection we compare specific predictions of 
the models with a view to determining which provide the 
closest match to the observations, and in order to iden- 
tify possible explanations for any discrepancies. A generic 
prediction of hierarchical structure formation is that mas- 
sive elliptical galaxies form from the merging of smaller disc 
galaxies and hence formed relatively recently (White & Rees 
1978). In contrast, the traditional passive-evolution model 
has massive elliptical galaxies forming in single monolithic 
collapses at high redshift and then evolving passively to the 
present day with no further star formation (Eggen, Lynden- 
Bell & Sandage 1962). Thus, as Kauffmann & Chariot (1998) 
point out, the number density of massive galaxies at z ~ 1 
(z > 1.5 for ACDM) can provide a sensitive test to differen- 
tiate between the two models. In particular a near-IR survey 
such as this is crucial since infrared light is a much more ro- 
bust tracer of total stellar mass than optical light out to z ~ 
1-2. Kauffmann & Chariot (1998) compared the redshift dis- 
tributions of the K < 18 sample of Songaila et al. (1994) and 
the K < 19 sample of Cowie et al. (1996) with predictions 
of hierarchical merger models and pure luminosity evolution 
(PLE) models, finding that the PLE model greatly overes- 
timated the z > 1 cumulative counts while the hierarchical 
merger model provided a much better fit to the observations. 
However the sample sizes were small (~ 170 galaxies in all), 
may suffer from spectroscopic incompleteness at z ~ 1, and 
in any case it is useful to repeat the test with a currently 
more favoured cosmology. 

Since a large fraction of EROs are thought to be z ~ 1 
massive evolved galaxies, the relative efficiency with which 
they may be isolated using just one optical and one near- 
IR colour, led several groups to use the number densities 
of EROs as a test for various models, sometimes with the 
conclusion that PLE models greatly overpredict the num- 
ber densities of EROs (Zepf 1997; Barger et al. 1999) and 
sometimes with the conclusion that PLE models are in good 
agreement with the observations (Daddi et al. 2000b; DCR). 
One outcome of these surveys was the result that the ERO 
population is very strongly clustered so that field-to-field 
variations lead to the requirement of wide-field surveys in 
order to get good estimates of the number density of these 
objects (Daddi et al. 2000a (D00); McCarthy et al. 2000). 
In particular, the observations presented here cover a com- 
parable area at a comparable depth (using H — K ~ 1) to 
those presented by D00, while the addition of our photomet- 
ric redshifts allows us to estimate N(z) for the EROs and 
in particular to isolate z ~ 1-1.5 galaxies of any spectral 
type, thus allowing more detailed comparisons with models. 
From the observed clustering amplitude, we expect that our 
estimated ERO number counts are representative to within 
15 per cent for R— H > 4 and I — H > 3 samples (see §^|). 
We begin by comparing H number counts N(M) in Fig. 
(also Table Q). Here, in contrast to the rest of the paper, 
we use for each object the brightest of its isophotal mag- 
nitude corrected to total magnitudes by assuming a gaus- 
sian profile (SExtractor's MAG_IS0C0R parameter) which we 
found to be a good measure of total magnitude at the 
bright end, and its seeing-corrected 3 arcsec aperture mag- 
nitude. We do this because the aperture effect varies with 



Table 2. Galaxy number counts as a function of H magnitude 
in the HDFS field of the LCIR survey (see also McCarthy et al. 
2001 and Chen et al. 2001). The quoted errors are Poisson. The 
slight turnover in the last magnitude bin is artificial (see caption 
to Fig. and text for details). 



H magnitude ./V/mag/deg 2 



15.25 


110±40 


15.75 


210±60 


16.25 


370±80 


16.75 


740±110 


17.25 


1330±150 


17.75 


2230±200 


18.25 


4100±300 


18.75 


5400±300 


19.25 


8700±400 


19.75 


11300±500 


20.25 


12200±500 



magnitude, being larger for close bright extended galaxies. 
This is unimportant for a simple H < 20.0 sample but is 
important for number counts as a function of magnitude. 
Conversely, just using isophotal magnitudes would severely 
underestimate the flux of the faintest objects. Our galaxy 
number counts agree fairly well with those of McCarthy 
et al. (2001) from 0.62 degrees 2 and the deep NICMOS 
counts of Yan et al. (1998). The faint end (18 < H < 20) 
slope, dlog N(H)/dH , is 0.38. This compares with 0.31 for 
H > 20.0 (Yan et al. 1998) and 0.47 for H < 19 (Martini 
2001a). The no-evolution models fit the observed number 
counts very well. The passive-evolution models based on the 
Bj-band luminosity function (LF) do not fit the observa- 
tions - possibly due to the uncertainties involved in extrap- 
olating from rest-frame Bj-band magnitudes to observed- 
frame i/-band with model SEDs that are only approximate 
matches to the spectral types identified by M01. Extrapo- 
lating from the K-band is more robust and less dependent 
on the details of the model SEDs. However, despite better 
agreement at bright magnitudes, the slope at H ~ 20 in 
the if-band LF passive-evolution models is too steep com- 
pared with the observations (but cf. Totani et al. 2001a). 
The semi-analytic model matches the slope fairly well, ex- 
cept in the faintest magnitude bins, but the normalization is 
about 50 per cent too high. As noted in §[], ~15 per cent of 
this difference could be due to galaxies discarded from the 
LCIR survey as stars. The remainder may indicate a prob- 
lem with the semi-analytic model normalization since it is 
present even in the lowest photometric redshift bin (Fig. |24| 
next paragraph) . Correct normalization has been a common 
problem with number counts models as 0* for local LFs is 
often uncertain to within a factor of ~2 - therefore a com- 
mon practice elsewhere has been to renormalize the models 
to observed counts at relatively bright magnitudes. 

A more sensitive discriminant between models is the 
redshift distribution - as inferred from photometric redshifts 
here - and in particular the redshift distribution of different 
spectral types. In Fig. ^ we compare the redshift distri- 
bution N(z) of the observations with the various models. 
As noted in the introduction to this section we plot both 
H < 20.0 and H < 20.5 observed samples so that if we 
were able to correct for the aperture effect then the result- 
ing N(z) would lie somewhere between the two. We estimate 
errors by bootstrap-resampling the whole galaxy catalogue. 
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Figure 23. Galaxy number counts as a function of H magnitude 
in the HDFS field of the LCIR survey (see McCarthy et al. 2001 
and Chen et al. 2001 for deeper completeness-corrected counts 
from the full LCIR survey) . For each galaxy we use the brightest 
of its corrected isophotal and corrected aperture magnitudes (as 
explained in the text). This results in the slight artificial turnover 
in the last magnitude bin. Also note that the galaxy counts are 
expected to be depleted at the 15 per cent level due to the conser- 
vative star/galaxy separation criteria employed. Also plotted are 
the number counts of McCarthy et al. (2001) from 0.62 degrees 2 
and the deep NICMOS counts of Yan et al. (1998). Several mod- 
els are also plotted (see for details). The models based on the 
A"-band LF are a closer match to the H-band observations than 
those based on the Bj-band LF, and the observed counts are 
most consistent with the no-evolution model. The semi-analytic 
model follows the observed slope except at the faintest magni- 
tudes but is normalized ~50 per cent too high. Also plotted are 
EROs satisfying R - H > 4. These are well-fitted by the E/S0 
galaxies with Ft — H > 4 from the no-evolution model (plotted) 
but the semi-analytic model has far too few R — H > 4 galaxies 
compared with the observations. 



Then for each galaxy we use the S/N characteristics of the 
survey to add random photometric errors in each filter. Pho- 
tometric redshifts and spectral types were then recalculated 
for these galaxies and the new redshift distributions were 
calculated. This was repeated 60 times and error bars were 
estimated using the central 90 per cent percentile. Fig. ^ 
illustrates the same data as cumulative fractions in order to 
factor out the normalization. Apart from the normalization 
of the semi-analytic model, it and the no-evolution model are 
in broad agreement with the observations, the no-evolution 
model perhaps lacking galaxies in the highest (z ~ 1-1.5) 
redshift bins and the semi-analytic model predicting an ex- 
cess in the lowest redshift bin. The latter may be due to an 
excess of bright galaxies in the z = K-band luminosity 
function in the semi-analytic model, as demonstrated in SP. 
The similarity between the no-evolution and semi-analytic 
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Figure 24. The photometric redshift distribution of LCIR sur- 
vey galaxies (upper left) and the corresponding redshift distribu- 
tion of galaxies in several model catalogues: semi-analytic merger 
model (upper right), no-evolution model (lower left) and passive- 
evolution with two models for evolving z = E/S0 spectral types 
(lower right). In the latter, the bold curves are calculated us- 
ing the A"-band LF while the thin curves use the Sj-band LF; 
the observations are replotted as histograms to facilitate compar- 
ison. Error bars are 90 per cent percentiles from bootstrap re- 
sampling followed by Monte Carlo simulations in which random 
photometric errors were added to the observed photometry and 
photometric redshifts were recalculated. As noted in the text the 
observations are expected to underestimate the counts slightly 
due to some galaxies being mistakenly removed as stars. The 
semi-analytic and no-evolution models are plausible fits to the 
observations but the passive-evolution models predict too many 
high-redshift galaxies. 



model predictions over this redshift range results from a 
'conspiracy' effect for bright galaxies in the semi-analytic 
model: although the galaxy stellar mass function evolves sig- 
nificantly from z — 1.5 to z = 0, this is offset by evolution 
in the mass-to-light ratio which increases, even in the near- 
IR, as stellar populations age and become more metal rich, 
resulting in little evolution in the luminous component of 
bright galaxies. The passive-evolution models based on the 
Bj-band LF predict far too many z > 0.75 galaxies, while 
that based on the A"-band LF with the E/S0 population 
modelled by high-redshift SSP bursts provides a closer fit 
to the observations though the median redshift is still too 
high. In the remainder of this section we consider only the 
no-evolution and passive-evolution models based on the K- 
band LF as those based on the Bj-band LF are a poorer fit 
to the infrared observations. 

In Fig. |26| we plot the redshift distributions broken down 
into spectral types. For the no-evolution model we take the 
redshift distributions estimated from the G97 K-ha,nA LF 
matched respectively with the CWW E, Sbc, Scd and Im 
SEDs, with the resulting N(z) distributions renormalized 
to the total observed counts for each corresponding spec- 
tral type. We do this because the divisions between the 
galaxy types in G98 do not correspond exactly to those im- 
posed on the observations by fitting the four CWW template 
SEDs, and in any case G98 identifies six types while we di- 
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Figure 25. The cumulative redshift distributions of galaxies in 
the LCIR survey and several models. The SSP burst SED used in 
one of the passive-evolution models is very bright at its formation 
redshift zt, so with Zj = 10 as adopted here, it is bright enough 
to introduce a population of z ~ 10 galaxies into the sample. 
This can be prevented by choosing a higher Zj (e.g. zt ~ 30), 
lengthening the duration of the star-forming burst or by invoking 
dust obscuration. Thus we also plot the cumulative distribution of 
this model with the z ~ 10 population removed (B (hidden) +...). 
The semi-analytic model provides the best fit to the observations. 

vide the LCIR survey galaxies into only four types. For the 
semi-analytic and passive-evolution models, where the SEDs 
evolve with redshift so that a spectral type may look like the 
CWW E SED at z = but look like the CWW Scd SED 
at z ~ 2 say, we instead fit the best CWW spectral type 
to each galaxy using the observed-frame colours - i.e. in a 
manner analogous to calculating photometric redshifts for 
the observations except without allowing z to vary for any 
given galaxy. 

In the LCIR survey, spectral-type E galaxies are the 
most common spectral type at z < 0.75 but at higher red- 
shifts they begin to become less dominant relative to later 
spectral types and drop out by z ~ 1.25. The no-evolution 
model agrees fairly well with observations but underpredicts 
the number of spectral-type Sbc galaxies at z > 1 - consis- 
tent with some luminosity evolution or merging in the ob- 
servations. In contrast the semi-analytic model greatly un- 
derpredicts the fraction of spectral-type E galaxies, except 
perhaps in the lowest (z < 0.25) redshift bin, while overpre- 
dicting the numbers of spectral-type Scd and Im galaxies 
especially towards higher redshifts. This discrepancy could 
be explained by excess star formation in the semi-analytic 
model, for example from too frequent merging with the re- 
sulting bursts of star formation being over- prolonged; or if 
a large fraction of the observed z ~ 1 spectral-type E galax- 
ies are in clusters then some process such as ram pressure 
stripping or the hot cluster environment preventing gas from 
cooling could act to inhibit star formation. Some of these 
processes may not be modelled properly in the semi-analytic 
model. 

Our passive-evolution model using an SSP burst model 
for evolving local E/S0 galaxies predicts too many spectral- 
type E galaxies at z > 0.75. If instead we use the r = 1 Gyr 



Figure 26. The redshift distributions of different CWW spec- 
tral types in the observations (H < 20.0 - solid line; H < 20.5 
- dashed line), the semi-analytic merger model, the no-evolution 
model (using the _R"-band LF and with the observations plotted 
as histograms to facilitate comparison) and two passive-evolution 
models (using a single stellar population burst (dotted line) and 
a model with exponentially decaying star formation rate with 
time-scale 1 Gyr (dashed line) respectively, both with formation 
redshift Zf = 10, to model the E/S0 population). Error bars 
are 90 per cent percentiles from bootstrap resampling followed 
by Monte Carlo simulations in which random photometric errors 
were added to the observed photometry and photometric redshifts 
and spectral-types were recalculated. The sharp boundaries in 
the passive-evolution models as BC SEDs evolve from one CWW 
spectral type to another are a consequence of modelling galaxy 
populations with discrete spectra instead of a continuous range. 
Compared with the observations, the semi-analytic model pre- 
dicts an excess of bluer spectral types relative to redder spectral 
types at z > 0.5. 



model we get better agreement for spectral-type E galaxies 
but evolving these galaxies back gives rise to an excess of 
Sbc types at 0.75 < z < 1.5 and Scd/Im types at z > 
1.5. We note that there is wide scope for variation of the 
passive-evolution models - choice of formation redshift Zf, 
star formation history, modelling of dust, LF, initial mass 
function, cosmology and so on. In particular, agreement with 
the observations could be improved if high-redshift bursts 
of star-formation are heavily obscured by dust (Smail et al. 
1999; Blain et al. 1999a). 

The photometric redshift distributions of the EROs are 
useful for interpreting their angular clustering (§|§| §^) and 
surface number density. In this paper we define EROs by 
several colour cuts - R - H > 4, R - H > 4.5, I - H > 3 
and I — H > 3.5. Taking typical colours for re d g alaxies 
at 2 ~ 1 of R - I ~ 1.5 and H - K ~ 1 (Fig. Q, these 
colour cuts correspond to R— K colours of roughly 5, 5.5, 5.5 
and 6 respectively. It is worth noting that these colours are 
less extreme than those used by many authors - e.g. Smail 
et al. (1999) and Thompson et al. (1999) use R - K > 6 
(see also Totani et al. 2001b) - but are similar to those 
used by Daddi et al. (2000a). The evidence suggests that 
the reddest EROs are dusty starbursts while less red ERO 
samples, such as ours, are dominated by 'ordinary' galaxies 
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with evolved stellar populations (Smail et al. 1999; Daddi 
et al. 2000b) . Furthermore, since the filter pairs R — H and 
I — H are closer together than the more traditional R — 
K, they are less sensitive to the smoother break in dusty 
galaxies compared with a strong 4000 A break in z ~ 1 
evolved elliptical galaxies, and the I — H cut is also more 
sensitive to evolved galaxies that, due to small amounts of 
residual star formation - as might follow late merger events 
- would be missed from an R — K colour cut. 

In Fig. H we plot N(z) for EROs with R - H > 4 
and R - H > 4.5 and in Fig. || we plot N(z) for EROs 
with I - H > 3 and EROs with I - H > 3.5. The me- 
dian redshifts are typically 1—1.2, with the redder samples 
being at the higher redshifts, and the redshift range is rel- 
atively restricted, especially for the redder samples, thus 
providing one explanation for the strong angular clustering 
observed for these objects. The observed EROs are not all 
best-fitted by the CWW E SED. Instead about 70 per cent 
are spectral-type E while about 30 per cent are spectral-type 
Sbc. The median redshift of the Sbc group is ~ 0.3 greater 
than the median redshift of the E group, consistent with 
a small amount of residual star formation besides a strong 
4000 A break in massive systems at these slightly higher red- 
shifts (cf. Franceschini et al. 1998). Taking a typical H — K 
colour of 1 (Fig. p9| ), our number density of 0.5 arcmin -2 
(H < 20.0; 744 arcmin 2 ) to 1 arcmin -2 (H < 20.5; 407 
arcmin 2 ) of EROs with R — H > 4 agrees well with the 
number density of 0.5 arcmin -2 of EROs with R — K s > 5 
and K a < 19 of D00 (701 arcmin 2 ). At R - H > 5 and 
H < 20 we find 0.06 arcmin -2 compared with 0.04 arcmin -2 
(Thompson et al. 1999; 154 arcmin 2 ) and 0.07 arcmin -2 
(D00) for R - K > 6 and K < 19 and at R - H > 4.3 and 
H < 19 we find 0.03 arcmin -2 compared with 0.05 arcmin -2 
(Martini 2001b; 185 arcmin 2 ) and 0.03 arcmin -2 (D00) for 
R - K > 5.3 and K < 18. 

The number density of EROs can provide a sensitive 
constraint on models (Zepf 1997; DCR) being essentially 
the same test as the number density of z ~ 1 spectral-type 
E galaxies. We have listed the ERO fractions in the observa- 
tions and models for various red colour cuts in Table ^. The 
no-evolution model based on the K-band LF predicts that 
30 per cent of galaxies in the magnitude range 19 < H < 20 
have R — H > 4, exceeding the observed fraction of ~18 
per cent by two thirds. This fraction would decrease to 17 
per cent if we matched a bluer spectral type to spectral 
type 2 (see §|.2) instead of fitting the CWW E SED to 
both types 1 and 2. We plot the R — H > 4 counts for this 
model in Fig. ^ showing that they provide a good fit to 
the observed counts. On the other hand the semi-analytic 
model greatly underestimates the number of EROs, predict- 
ing that only 4 per cent of galaxies in this magnitude range 
have R-H > 4 (see also DCR; Martini 2001b). However the 
semi-analytic model does appear to produce enough bright 
galaxies at these redshifts (Fig. ^) indicating that the prob- 
lem lies in the modelled star formation history of massive 
galaxies rather than the assembly of mass through merg- 
ing. The deficit of EROs in the semi-analytic model could 
be due, as mentioned above, to some process inhibiting late 
star formation in massive galaxies that may not be modelled 
properly in the semi-analytic model, or it could be due to a 
significant fraction of the observed EROs being very dusty 
ULIRG-type galaxies and these are not modelled correctly 




Figure 27. The observed redshift distribution of EROs with 
R-H > 4 and with R-H > 4.5 in the LCIR survey for H < 20.0 
and H < 20.5 (upper left), the semi-analytic model (upper right), 
the no-evolution model (lower left) and two passive-evolution 
models (lower right). Error bars, shown on the R — H > 4, 
H < 20.5 sample only, are 90 per cent percentiles from boot- 
strap resampling of the full galaxy catalogue followed by Monte 
Carlo simulations in which random photometric errors were added 
to the observed photometry and photometric redshifts and colour 
cuts were recalculated. Here we have used the G97 i<"-band LF for 
the no-evolution and passive-evolution models. Note the different 
scale used in the lower relative to the upper panels. The median 
redshift of the observed EROs is z ~ 1 with the redder sample 
being at slightly higher redshift. The semi-analytic model greatly 
undcrpredicts the observed ERO counts while the no-evolution 
model is a closer fit to the observations. The passive-evolution 
models generally overpredict the number of EROs, especially the 
model (B) where the E/S0 population is modelled by a high- 
redshift SSP burst (note that the r = 1 Gyr model (E) does not 
actually produce any R — H > 4.5 galaxies though). By tuning 
the star formation history and formation redshift of these models 
it is possible to get the passive-evolution model to fit the observed 
ERO number counts. 



by the simple dust formalism used (equation g) (see also 
Guiderdoni et al. 1998; Blain et al. 1999b). 

The passive-evolution models - in particular those 
where present-day ellipticals are modelled by SSP high- 
redshift bursts - generally overpredict the number of EROs. 
However it is possible to tune the star formation history, 
formation redshift and fraction of the LF's normalization cj>, 
taken to correspond to the ERO population, in order to fit 
the observed numbers of EROs (e.g. DCR; McCarthy et al. 
2001). Dust obscuration may be invoked in these models to 
hide intense star-formation activity at high redshifts in order 
to fit the overall counts and redshift distributions (Mazzei, 
de Zotti & Xu 1994; Franceschini et al. 1994, 1998; Blain et 
al. 1999a) and this may be consistent with populations of 
high-redshift ULIRG-type galaxies being the progenitors of 
the z ~ 1 ERO population and ultimately present-day ellip- 
tical galaxies (de Zotti et al. 2001, and references therein). 

While we hoped to use the z ~ 1 density of evolved 
galaxies to discriminate between passive-evolution and hi- 
erarchical merging models, the test is compromised by the 
considerable freedom allowed within the passive-evolution 
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Table 3. Table showing the fraction of EROs, in the magnitude range 19 < H < 20, denned by various optical-near-IR colour cuts in 
the observations and in several models. For the passive-evolution model we fitted two alternative BC SEDs to the E/SO LF - (B) a single 
stellar population burst and (E) exponentially-decaying star formation with time-scale r = 1 Gyr, both with formation redshift Zf = 10. 
Here we have used the G97 iC-band LF for the no-evolution and passive-evolution models. The (E) model produces no R — H > 4.5 
or I — H > 3.5 galaxies, but could still be consistent with the observations if the reddest EROs are a separate population of dusty 
ULIRG-type galaxies (e.g. Smail et al. 1999 and references therein). 

sample R- H > 4.0 R - H > 4.5 / - H > 3.0 I - H > 3.5 



LCIR survey 18% 

semi-analytic model 4.3% 

no-evolution model 30% 

passive-evolution (B) 38% 

passive-evolution (E) 29% 



7% 11% 2.7% 

0.3% 2.5% 0.1% 

11% 9% 2.7% 

28% 26% 19% 

0% 21% 0% 




0.0 0.5 1.0 1.5 0.0 0.5 1.0 1.5 2.0 redshift number 

redshift 



Figure 28. The observed redshift distribution of EROs with 
/ - H > 3 and with / - H > 3.5 in the LCIR survey for H < 20.0 
and H < 20.5 (upper left), the semi-analytic model (upper right), 
the no-evolution model (lower left) and two passive-evolution 
models (lower right). Error bars, shown on the I — H > 3, 
H < 20.5 sample only, are 90 per cent percentiles from boot- 
strap resampling of the full galaxy catalogue followed by Monte 
Carlo simulations in which random photometric errors were added 
to the observed photometry and photometric redshifts and colour 
cuts were recalculated. Here we have used the G97 _R"-band LF for 
the no-evolution and passive-evolution models. Note the different 
scale used in the lower relative to the upper panels. The median 
redshift of the observed EROs is z ~ 1.2. The semi-analytic model 
greatly underpredicts the observed ERO counts, the passive- 
evolution models generally predict an excess of EROs (but may 
be fine-tuned to fit the observations) and the no-evolution model 
is a better fit to the observations. 



Figure 29. Absolute K magnitude Mk as a function of red- 
shift for galaxies in the LCIR survey (upper left) and divided by 
spectral type (upper right), and for galaxies in the semi-analytic 
model (lower left). In the LCIR survey, Mk has been estimated 
by extrapolating the observed UBVRIH photometry to the rest- 
frame A"-band using the best-fitting CWW SED for each galaxy. 
The number of galaxies in the semi-analytic model has been scaled 
to match the observations. EROs with R — H > 4 are marked by 
filled circles: there are many more EROs in the observations than 
in the model. At lower right are histograms of Mk summed over 
the redshift range 0.9 < z < 1.4. The faint cut-off is imposed by 
the H < 20 magnitude limit. Using Mk as a tracer of total stellar 
mass, the plot indicates that despite underpredicting the number 
of EROs the semi-analytic model does in fact produce enough 
bright galaxies (indicated by the bright-end Mk magnitudes) in 
the redshift range occupied by the observed EROs. 



frame-work, so that the passive-evolution models can be 
adjusted to fit a wide range of potential observations. A 
proper treatment must include constraints from a range of 
wavebands - something which is beyond the scope of this 
paper. Within the LCIR survey it is expected that more 
stringent constraints will be produced with the inclusion of 
further fields, some of which (a) have deeper near-IR imag- 
ing - allowing one to study galaxies in greater numbers at 
z ~ 1.5-2, where the number density of evolved galaxies 
more strongly discriminates between hierarchical merging 
and passive-evolution models and puts more stringent con- 
straints on the latter, and (b) have ZJK imaging - which, 
through increased spectral coverage, will allow better con- 



straints to be placed on the role of dust in these objects. 
On the other hand the observations have usefully been used 
to identify discrepancies in the semi-analytic model that are 
worthy of further investigation. 



8 ANGULAR CLUSTERING 

The angular correlation function w(6) gives the excess prob- 
ability over a random Poisson distribution that two sources 
will be found in the solid angle elements SQi and se P~ 
arated by an angle and is defined by 

5P = n 2 5n 1 Sn 2 [l + w(6)], (4) 
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where n is the mean number density of objects. A standard 
procedure for estimating w(8) from a catalogue of objects is 
to generate a random catalogue of a large number of points 
within the same area covered by the observations and then 
for each 8 count the number of distinct data-data (DD) 
pairs, data-random (DR) pairs and random-random (RR) 
pairs with separation 8 to 8 + d9. It is important that the 
random points are subject to the same selection effects as the 
data points. Due to the relatively small size of infrared de- 
tectors and various instrumental artifacts, the infrared data 
are relatively patchy in terms of depth so in this paper we 
have adopted conservative magnitude limits such that the 
remaining catalogue is essentially spatially uniform (see ^). 
The masked regions around bright stars are likewise removed 
from the random catalogue. With DD, DR and RR as de- 
fined above, w(8) is estimated by: 



H < 20.0 sample 



DD x RR 
DR 2 



(5) 



(Hamilton 1993). Following Hewett et al. (1982) we estimate 
errors analytically as \J (I + w(8))/DD. These are similar 
to lcr errors (Sharp 1979; but see also Mo, Jing & Borner 
1992). Proceeding in this manner we calculate w(8) for var- 
ious subsamples of the observations selected by H magni- 
tude, colour, photometric redshift and spectral type. At low 
redshifts, w(8) is well-approximated by w(8) = A(8 1 ~ 1 — C) 
(Groth & Peebles 1977; Maddox et al. 1990), where A is the 
clustering amplitude and C is the integral constraint 



C 



1 

9? 



8 L - 1 5Q, 1 5Q.2 



(6) 



Typically 7 = 1.6-1.8 depending to some extent on Hubble 
type. We fix 7 = 1.8 since this value has been frequently 
chosen in the literature when the quantity of data is too 
small to allow an accurate determination of both A and 7. 
We calculate C ~ 0.141 for the area comprising all six tiles 
and C ~ 0.151 for the area comprising the three deeper 
tiles (8 in arcminutes). We then obtained the amplitudes 
at 1 arcmin ( = 0.7 /i _1 Mpc in comoving coordinates at 
2 = 1 and 1.1 /i _1 Mpc at z = 2 for an Q = 0.3, tt A = 0.7 
cosmology) of w(8) by least-squares fitting over the angular 
range 4-600 arcsecs. 

Fig. ^ shows the correlation function of the stars in 
the H < 20.0 and H < 20.5 samples. It is a useful check 
on the quality and spatial uniformity of the catalogues that 
the correlation function of stars is consistent with zero clus- 
tering. Forcing a power-law fit returns an amplitude A of 
0.002 (H < 20.0) compared with 0.08 for the galaxies. The 
value 0.002 is in fact consistent with contamination by the 
estimated ~15 per cent fraction of galaxies that may have 
been conservatively classed as stars (see §||). In the remain- 
ing w(8) figures the integral constraint term has been added 
to the data so that the power-law fit is a straight line on a 
log plot. Fig. |l| compares w(8) for different H magnitude 
bins. The trend as one moves to fainter magnitudes is for the 
amplitude to decrease - a fainter sample covers a greater red- 
shift range and hence, in projection, there is greater dilution 
of the intrinsic clustering signal. These results are summa- 
rized in Table ^ in which we also note the increasing median 
redshift (from 0.42 to 0.69) with increasing H magnitude. 

In Fig. [32] we compare w(6) for different photo- 
metrically-determined spectral type classes - all galaxies, E, 



1 



H < 20.5 sample 



B (arcsecs) 

Figure 30. The angular correlation function w(9) of stars in the 
H < 20.0 catalogue (top panel) and in the H < 20.5 catalogue 
(lower panel). The observations are consistent with zero clustering 
implying that there is little loss of galaxies during star/galaxy 
separation and that the spatial uniformity of the data is good. 



A = 0.12 " 18.0 < H < 19.0 A = 0.10 




10 100 
8 (arcsecs) 



10 100 
9 (arcsecs) 



Figure 31. The angular correlation function w(8) of galaxies in 
H magnitude bins. The amplitude decreases as fainter magnitude 
limits encompass broader N(z) distributions. 



Sbc and Scd + Im - in the H < 20.0 catalogue. While the 
E and Sbc classes are equally strongly clustered, the Scd + 
Im class is much less strongly clustered. These results, and 
those for the H < 20.5 catalogue, are summarized in Table 
|E| This table also contains spatial clustering results from §^ 
and we leave further discussion until that section. In Fig. 
p3| we show angular clustering results for several red colour 
cuts. The R — H > 4 and I — H > 3 galaxies are four times 
more strongly clustered than the sample of all galaxies in 
the H < 20.0 catalogue but this drops to only two-and-a- 
half times in the H < 20.5 catalogue. This is probably due 
to both the broader redshift distribution and the fainter av- 
erage luminosity of the fainter sample. Indeed if we compare 
the clustering oi R— H > 4 or I — H > 3 galaxies with that 
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Table 4. The number of galaxies TV, median photometric redshift z m , median absolute B and J magnitudes Mb and Mj (h = 1) and 
amplitude A of w(9) at 1 arcmin for H magnitude bins. The approximately la errors on A are derived from \/(l ± w{8)) / DD errors on 
u>(9). Extra statistics are included in Table ftcl in the appendix. 



sample N z m M B Mi A 



H < 19 


1284 


0.42 


-19.2 


-21.1 


0.12 ±0.03 


18.0 < H < 19.0 


868 


0.47 


-19.1 


-21.0 


0.10 ±0.04 


18.5 < H < 19.5 


1292 


0.55 


-19.1 


-20.9 


0.07 ±0.03 


19.0 < H < 20.0 


1893 


0.61 


-19.1 


-20.8 


0.06 ±0.02 


19.5 < H < 20.5 


1472 


0.69 


-18.9 


-20.5 


0.07 ±0.02 



All goloxles A = 0.08 


E A = 0.13 


1 ' 1 1 _ 

Sbc A = 0.12 


Scd 
SI .V. 


St Im A = 0.01 

-4tt, ! 

v v si -W>>n. 1 1 Jr. i/ .v. * 



H < 20, iS - U > 4 jj . 0.35 ff < 20.5, R - H > A A - 0.17 



1 00 10 1 00 

(orcsecs) (orcsecs) 



Figure 32. The angular correlation function w(9) of the full 
galaxy sample in the H < 20.0 catalogue and of different 
photometrically-determined spectral type classes. The amplitude 
is greatest for red spectral types. 



of R — H < 4 or I — H < 3 galaxies in the same photometric 
redshift range (Table []) then the difference in clustering am- 
plitude is only a factor of two in both samples. We discuss 
these results further in §[| In Fig. |3^ we compare w(6) in 
different photometric redshift bins from z — to z — 1.5. 
The projected positions of thegalaxies in the different red- 
shift bins are plotted in Fig. pa. The angular clustering in 
0.5 redshift bins increases dramatically with redshift. From 
Table (?] we see that this correlates with an increase in me- 
dian absolute magnitude from Mi — 5 log ft = —19.9 in the 
lowest redshift bin to Mi — 51og/i = —22.5 at z m ~ 1.2. 
The increase in clustering amplitude with redshift is less 
pronounced in the fainter H < 20.5 sample. 

Though we have fitted a 1 — 7 = —0.8 power-law to 
w(0) (in view of the small sample sizes and for consistency 
with other studies) there is indication that a flatter slope is 
more appropriate for the fainter magnitudes (Fig. |3l] ) , later 
spectral types (Fig. ^2|) and higher redshift bins (Fig. |34|) in 
agreement with e.g. Loveday et al. (1995), Postman et al. 
(1998), Brown, Webster & Boyle (2000) and Kauffmann et 
al. (1999b). 

Given the observed clustering amplitude A for a sample 
we may calculate the expected fluctuations in the number 
counts in a region of given angular size. Following Peebles 
(1980) and Lahav & Saslaw (1992), the variance in the num- 
ber counts in square cells of size s = a x a arcmin 2 is 




100 

9 (orcsecs) 



(orcsecs) 



Figure 33. The angular correlation function of EROs with R — 
H > 4 and I — H > 3 for two H limiting magnitudes. Points have 
been omitted in the smallest angular separation bins where there 
are too few data-data pairs to obtain a decent measurement. For 
the H < 20.0 sample, the clustering amplitude is four times that 
of the galaxy population as a whole. This decreases to only about 
two-and-a-half times in the fainter sample, explained both by the 
more restricted redshift range of the brighter sample (Fig. ET] and 
Fig. and the greater intrinsic luminosity of these galaxies. The 
clustering amplitude of the R — H > 4, H < 20 sample is in good 
agreement (within 10 per cent) of that measured by Daddi et al. 
(2000a) for R - K s > 5, K s < 19 objects. 



((7v-iv) 2 ) = iv + (f f J J s w(e)d Sl ds 2 

= N + iV 2 C 7 a 1 - 7 A 



(7) 



where N is the mean number per cell, and C\ w 2.25 for 
7 = 1.8. Table [s] lists the observed a of the number counts 
over the six tiles, the expected a if the objects were dis- 
tributed according to a Poisson distribution and the calcu- 
lated a using equation (frt) for various samples. For stars the 
observed, Poisson and calculated a are comparable (less so 
in the fainter magnitude bins where there is some galaxy 
contamination). For galaxies, the observed a is comparable 
to the calculated a but much larger than the Poisson value 
- as expected, since galaxies are clustered. In equation (Q) 
we required the mean number density n of a sample of ob- 
jects in order to calculate w(8). It is important therefore 
to check that the survey is large enough to provide a good 
estimate of n. To do this we consider the calculated a and 
scale by l/VM where M is the number of tiles. For most 
samples the fluctuations in number counts between areas 
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Table 5. The number of galaxies N, median photometric redshift z m , median absolute B and I magnitudes Mb and Mj (h = 1), 
amplitude A of w(9) at 1 arcmin and spatial correlation lengths r in /i _1 Mpc (comoving coordinates) for various spectral types. We 
fix 7 = 1.8. The correlation length is calculated for two cosmologies f2o = 1, ^A = and Co = 0.3, Q A = 0.7. N(z) is derived from 
photometric redshifts. The errors on ro (e = —1-2) are approximately la and are carried through from the ^/(l + w(9))/DD errors on 
w(8). The errors on the other correlation lengths are of similar proportions. r*(z = 0.5) is listed for e = 0. For e = —1.2, r*(z) equals ro 
for any z. With median redshifts z m ~ 0.5, ~ 12 = r%~~ l ' 2 (z = 0.5) s=s rj~°(z = 0.5). Extra statistics are included in Table liol in the 
appendix. 

sample N z m M B Mi A r^~ 12 r*=° rj=°(0.5) r^p -1 ' 2 



Q = 0.3, Q A = 0.7 n = i, n A = 



H < 20.0 




















All galaxies 


3177 


0.52 


-19.1 


-20.9 


0.08±0.01 


5.5±0.4 


6.9 


5.3 


4.3±0.3 


E 


1194 


0.50 


-18.7 


-20.9 


0.13±0.03 


6.4±0.8 


8.1 


6.2 


5.1±0.7 


Sbc 


963 


0.61 


-19.6 


-21.3 


0.12±0.04 


7.7±1.3 


10.2 


7.8 


5.8±1.0 


Scd + Im 


1020 


0.48 


-19.1 


-20.5 


0.01±0.04 


1.9±1.9 


2.2 


1.7 


1.5±1.5 


H < 20.5 




















All galaxies 


2616 


0.57 


-19.0 


-20.8 


0.08±0.01 


6.1±0.4 


7.7 


5.9 


4.7±0.3 


E 


910 


0.53 


-18.5 


-20.7 


0.12±0.03 


6.6±0.9 


8.3 


6.1 


5.2±0.7 


Sbc 


735 


0.65 


-19.5 


-21.3 


0.12±0.04 


8.2±1.4 


11.0 


8.4 


6.1±1.1 


Scd + Im 


971 


0.52 


-19.0 


-20.4 


0.05±0.03 


4.2±1.4 


5.1 


3.9 


3.4±1.1 



Table 6. The number of galaxies N, median photometric redshift z m , median absolute B and / magnitudes Mb and Mj (h = 1), 
amplitude A of w(9) at 1 arcmin and spatial correlation lengths r in /i _1 Mpc (comoving coordinates) for various colour and photometric 
redshift-selected groups of red galaxies. We fix 7 = 1.8. The correlation length is calculated for two cosmologies f!o = F ^A = and 
f2o = 0.3, Hyv = 0.7. N(z) is derived from photometric redshifts. The errors on ro (e = —1-2) are approximately la and are carried 
through from the \J {1 + w(ff))/DD errors on w{9). The errors on the other correlation lengths are of similar proportions. r*(z = 1.0) is 
listed for e = 0. For e = —1.2, r*(z) equals ro for any z. Extra statistics are included in Table [To| in the appendix. 

sample N z m M B M 1 A r o" _1 ' 2 r cT° r ; =0 (!-0) r o" _1 ' 2 

n = 0.3, n A = 0.7 n = 1, n A = o 



H < 20.0 



All 


galaxies 




3177 


0.52 


-19.1 


-20.9 


0.08±0.01 


5.5±0.4 


6.9 


4.1 


4.3±0.3 


R - 


-H > 4 


0.7 < z < 1.5 


337 


1.01 


-20.2 


-22.3 


0.33±0.11 


11.1±2.0 


17.6 


11.1 


7.6±1.4 


R - 


-H < 4 


0.7 < z < 1.5 


480 


0.87 


-20.2 


-21.8 


0.15±0.08 


5.9±1.7 


8.8 


5.5 


4.1±1.2 


I- 


H > 3 


0.9 < z < 1.5 


201 


1.13 


-20.4 


-22.5 


0.36±0.18 


10.5±2.9 


17.3 


10.9 


7.0±1.9 


I - 


H < 3 


0.9 < z < 1.5 


268 


1.05 


-20.6 


-22.2 


0.17±0.13 


6.0±2.8 


9.6 


6.0 


4.0±1.9 


H < 20.5 
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galaxies 
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-19.0 


-20.8 


0.08±0.01 


6.1±0.4 


7.7 


4.8 


4.7±0.3 


R - 


-H > 4 


0.7 < z < 1.5 


312 


1.02 


-19.8 


-21.9 


0.17±0.09 


7.7±2.4 


12.1 


7.6 


5.2±1.6 


R- 


-H < 4 


0.7 < z < 1.5 


516 


0.89 


-20.0 


-21.6 


0.09±0.05 


4.7±1.7 


7.0 


4.4 


3.3±1.2 


I - 


H > 3 


0.9 < z < 1.5 


170 


1.16 


-20.3 


-22.4 


0.20±0.16 


7.5±3.7 


12.5 


7.9 


5.0±2.5 


I - 


H < 3 


0.9 < z < 1.5 
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1.08 


-20.4 


-22.0 


0.09±0.09 


4.4±3.3 


7.1 


4.5 


3.0±2.2 



the size of six combined tiles are less than 10 per cent. For 
the ERO samples the fluctuations are of order 15 per cent 
(20 per cent in the deeper area comprising only three tiles) 
so ideally we would like a larger area, such as the complete 
survey will provide, to get more robust measurements for 
these galaxies. 



9 SPATIAL CLUSTERING 

In order to interpret measurements of the angular corre- 
lation amplitude one must first deconvolve the projection 
effect: in a sample of galaxies with a wide redshift distri- 
bution N(z), the greater distance over which the sample 
is projected will result in greater dilution of the intrinsic 
clustering as compared with a sample of galaxies with a 



relatively narrow N(z). The statistic of interest is spatial 
clustering. 

Following Magiiocchetti & Maddox (1999), the angular 
correlation function w(ff) is related to the two-point spatial 
correlation function £(r) via the relativistic Limber's equa- 
tion (Peebles 1980): 



w{6) = 2 



Jo°° /o°° F- 2 {x)x i ^> 2 {x)i{r,z)dxdu 
[J^° F- 1 (x)x 2 ^(x)dx] 2 ' 



(8) 



where x is the comoving coordinate, F(x) gives the correc- 
tion for curvature, and the selection function <&(x) satisfies 



Af= / $(x)F- 1 (x)x 2 dx 



N(z)dz, 



(9) 



where N is the mean surface density on a surface of solid 
angle Q s and N(z)dz is the number of objects in the survey 
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Table 7. The number of galaxies N, median photometric redshift z m , median absolute B and I magnitudes Mb and Mj (h = 1), 
amplitude A of w(9) at 1 arcmin and spatial correlation lengths r in h~ 1 Mpc (comoving coordinates) for various photometric redshift- 
selected samples. We fix 7 = 1.8. The correlation length is calculated for two cosmologies Qo = 1, £7a = and Co = 0.3, Qa = 0.7. N(z) is 
derived from photometric redshifts. The errors on ro (e = —1-2) are approximately la and are carried through from the y/ (1 + w(0))/DD 
errors on w(8). The errors on the other correlation lengths are of similar proportions. r*(z m ) is listed for e = 0. For e = —1.2, r*(z) 
equals ro for any z. Since galaxies are localized into redshift bins, r»(z m ) is nearly independent of an assumed e. Extra statistics are 
included in Table llOl in the appendix. 

sample N z m M B Mj A r^" 12 rr°(z m ) r^" 1 ' 2 rr°{z m ) 



n = 0.3, n A = 0.7 n = 1, n A = 



H < 20.0 




















0.0 < z < 0.5 


1458 


0.32 


-18.1 


-19.9 
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3.3 
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0.30±0.10 


8. Oil. 5 


8.0 


5.4±1.0 


5.3 


H < 20.5 




















0.0 < z < 0.5 


1056 


0.32 


-17.8 


-19.5 
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1101 


0.66 


-19.3 
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-20.0 


-21.8 
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5.4±1.2 


5.3 
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3.7 
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Figure 34. The angular correlation function w(9) of H < 20.0 
galaxies selected by photometric redshift. The amplitude in- 
creases dramatically to higher redshifts for this apparent mag- 
nitude limited sample since at high redshifts only the most lumi- 
nous galaxies (presumably corresponding to peak fluctuations in 
the underlying matter density distribution) make it into the sam- 
ple. For a fainter H < 20.5 sample the increase to higher redshift 
is less dramatic. 



o.e < z < *.. 



.0 < z < 1.5 



x pixel coordinates 



Figure 35. The positions within the survey area of the H < 20.0 
galaxies in four photometric redshift bins. 



within the redshift shell (z,z + dz). To proceed we assume 
a power-law form for £(r c , z): 



r*(z) 



(10) 



where we assume 7 = 1.8, r c = (1 + z)r is the comoving 
separation between two sources whose physical separation 
is given (in the small angle approximation) by 

1/2 

r. -- I — + r'lr I , (11) 



+X ° 



and r*(z) is the comoving redshift-dependent correlation 



length, r* (z) is related to the zero-redshift correlation length 
ro via 



r»(z) = r (l + 2) 



[7- 



-3J/7 



(12) 



where e paramatrizes the evolution of clustering (e = cor- 
responds to constant clustering in proper coordinates and 
e = 7 — 3 = —1.2 corresponds to constant clustering in 
comoving coordinates). For galaxies in a relatively narrow 
redshift region with median redshift z m , r*(z m ) is the clus- 
tering length of the galaxies at z m and is nearly independent 
of the assumed value of e. 

Combining these equations gives 
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Table 8. Table showing for various samples the mean number 
counts per tile, the observed standard deviation in the number 
counts between the six tiles, the expected standard deviation if 
the objects in the sample were distributed according to a Pois- 
son distribution, and the expected standard deviation given the 
measured angular clustering amplitude (as described in the text). 
When calculating the observed standard deviation, the observed 
number counts in each tile were first normalized to the mean 
tile area. For stars, O" bservod ~ Toisscm ~ <?We)' F° r galaxies, 

^observed ^ &w{8) ^ CT Poisson • 



sample 


N 


^observed 


^Poisson 




All stars 


372 


28.9 


19.3 


21.5 


H < 19.0 


237 


22.6 


15.4 


16.0 


19.0 < H < 19.5 


64 


7.9 


8.0 


10.0 


19.5 < H < 20.0 


70 


15.4 


8.4 


11.2 


All galaxies 


533 


95.8 


23.1 


89.9 


E 


201 


38.9 


14.2 


43.1 


Sbc 


162 


38.5 


12.7 


34.6 


Scd + Im 


171 


23.4 


13.1 


17.3 


H < 19.0 


215 


42.3 


14.7 


44.3 


18.0 < H < 19.0 


145 


28.8 


12.0 


28.7 


18.5 < H < 19.5 


217 


38.6 


14.7 


36.3 


19.0 < H < 20.0 


318 


55.9 


17.8 


48.9 


0.0 < z < 0.5 


245 


32.6 


15.7 


45.7 


0.5 < z < 1.0 


223 


43.0 


14.9 


43.0 


0.8 < z < 1.3 


85 


24.1 


9.2 


23.1 


1.0 < z < 1.5 


57 


20.7 


7.6 


19.3 


R-H>4 


56 


18.5 


7.5 


19.7 


I-H > 3 


34 


11.7 


5.8 


12.9 



= H, J Q °° N(z) 2 P(n , z)(x(z)e)^r4zyF(z)dz 
%[f °° N(z)dz]» 

where H 1 = r[l/2]r[( 7 - l)/2]/r[ 7 /2] = 3.68 in the 
case where 7 = 1.8, Hq is the Hubble constant, P = 
(c/Ho)(dz/dx), and we derive N(z) from photometric red- 
shifts. For flat cosmologies (fio + fiA = 1), F(z) — 1, 



x ( z ) = -Lq- 1 / 2 [ ^ 

Ho J [(l + zr + n- 1 -^-- 

and 



P(fi , z) = fi /2 [(l + zf + fi ( 7 1 - 1] 



1/2 



(14) 



(15) 



The results for the various samples are listed in Tables 
H ^ and [?] for the cosmologies fio = 1, fiA = and fio = 0.3, 
fiA = 0.7. A more complete listing of results is given in Table 
|icj in the appendix. In the following discussion we use the 
fio — 0.3, fiA = 0.7 cosmology. 

From Table [s| the zero-redshift correlation length, ro, 
of the full galaxy sample, taking e = —1.2, is about 5.5- 
6 ft -1 Mpc, similar but slightly larger than typical local ro 
values of ~5-5.5 ft _1 Mpc as measured in opiica^y-selected 
surveys (e.g. 5.1 ft _1 Mpc Loveday et al. 1995; 5.2 ft _1 Mpc 
Postman et al. 1998; 5.4 h~ 1 Mpc Willmer, da Costa & Pel- 
legrini 1998; 4.9 ft _1 Mpc Carlberg et al. 2000; 5.9 ft _1 Mpc 
Cabanac, de Lapparent & Hickson 2000; 4.9 ft _1 Mpc Nor- 
berg et al. 2001a) 

The median redshifts of the E and Scd + Im samples 
are about 0.5 while the Sbc sample has a median redshift 
of about 0.6 and the median absolute rest-frame B mag- 



nitudes (as determined from photometric redshifts) are ap- 
proximately — 18.6 + 5 log ft (E), — 19.6 + 5 log ft (Sbc) and 
— 19.1 + 5 log ft (Scd + Im). For comparison, Folkes et al. 
(1999) provide the following values for L, galaxies in the 
2dF redshift survey: -19.6 (E/S0), -19.4 (Sb) and -19.0 
(Scd/Irr). Thus our spectral-type E sample reaches about 
one magnitude fainter than L, while the bluer samples reach 
about L t . The correlation length ro for e = —1.2, is about 
6.5 ft - Mpc for the E sample, 8 h~ 1 Mpc for the Sbc sample 
and 2-4 h~ Mpc for the Scd + Im sample. The low cor- 
relation length of blue or late-type galaxies relative to red 
or elliptical galaxies is well known at low redshifts (Love- 
day et al. 1995 find a factor of 1.3 difference; Carlberg et 
al. 1997 - factor of 2.7; Guzzo et al. 1997 - factor of 1.5; 
Willmer et al. 1998 - factor of 1.3; Brown et al. 2000 - 
factor of 2; Cabanac et al. 2000 - factor of 2.8) with some 
variation, depending in part on the particular selection cri- 
teria. For optically-selected surveys, this may be understood 
in the context that for the same optical magnitudes, red or 
early-type galaxies are intrinsically more massive than blue 
or late-type galaxies. For a near-IR-selected catalogue this 
is not so much of an issue within a given absolute magni- 
tude range, however when we compare all red galaxies with 
all blue galaxies in this paper, the red galaxies are on aver- 
age more luminous (in, for example, the rest-frame / band) 
and so are again expected to be intrinsically more massive 
than the blue galaxies. It is interesting that the Sbc sample 
is more strongly clustered than the E sample. This may be 
due to the different redshift and intrinsic luminosity distri- 
butions of the two samples - M/ !mo di an for the Sbc sample is 
~ 0.5 magnitudes brighter than Mr ime dian for the E sample - 
and/or the fact that we are probing to higher redshifts than 
local surveys so many of the brightest z > 0.75 spectral-type 
Sbc galaxies in this survey could be progenitors of present- 
day ellipticals. A clearer picture is expected to emerge with 
the full square degree of the LCIR survey, in which increased 
sample size will permit clustering measurements for different 
spectral types as a function of redshift and within restricted 
absolute magnitude ranges. 

The spatial correlation lengths for the ERO samples are 
listed in Table ^| The median redshifts of the R — H > 4 
and I — H > 3 samples are in the range 1 < z < 1.2. 
Taking e = —1.2, or equivalently r*(z ~ 1) for any e, the 
correlation length for the 7? — H > 4 and / — H > 3 samples 
with H < 20.0 is ~10.5 /i^Mpc. This decreases to ~7.5 
h~ Mpc for the H < 20.5 sample. These values are about 
1.5-2 times the local value of ro for all galaxies. We note 
that assuming an fio = 1 cosmology makes the comparison 
much less dramatic. Comparing with the correlation lengths 
of R — H < 4 and / — H < 3 galaxies in the same redshift 
range as the EROs (0.7 < z < 1.5 for -R — H > 4 and 
0.9 < z < 1.5 for I — H > 3) produces a difference in 
the red and blue object correlation lengths of order 1.7- 
1.8. This comparison is similar to the comparison of the 
correlation lengths of early and late or red and blue galaxies 
in local surveys and the factor of 1.8 difference is typical 
(Hermit et al. 1996; Brown et al. 2000; Cabanac et al. 2000). 
We surmise therefore that the strong clustering of EROs 
is not extraordinary. By comparison Willmer et al. (1998) 
measure ro ~ 8.0 /i~ 1 Mpc for local red galaxies with Mb — 
5 log ft < —19.5 and Guzzo et al. (1997) measure ro ~ 8.4 
h~ Mpc for local early-type galaxies with Mzm — 5 log ft < 
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— 19.5 while Norberg et al. (2001b) measure ro ~ 6.1 and 
7.6 /t _1 Mpc for samples of type 1 (M01) galaxies in the 
2dFGRS with median magnitudes Mb, — 51og/i = —20.3 
and —20.8 respectively. The strong clustering is evidence 
that the majority of the EROs are indeed massive galaxies 
in a restricted redshift range. For example, contamination 
at a level of ~20 per cent by uncorrelated galaxies over a 
wider redshift range or low-mass stars would increase the 
inferred ro of the remaining EROs by ~30 per cent. 

The photometric redshift bins 0.0 < z < 0.5, 0.5 < z < 
1.0, 0.8 < z < 1.3 and 1.0 < z < 1.5 have median redshifts 
z m of about 0.32, 0.65, 1.0 and 1.2 respectively (Table Q). 
In the H < 20.0 catalogue, r- t (z m ) (ro for e = —1.2) in- 
creases dramatically over these redshift bins, being respec- 
tively about 3, 4, 6.5 and 8 /i -1 Mpc (Fig. ^ ). Again we note 
that the increase is much weaker for an Qo — 1 cosmology. 
The probable explanation for the increase in clustering to 
high redshift is simply that the galaxies visible at high red- 
shift are on average intrinsically brighter and more massive 
than the galaxies at lower redshifts - the greatest peaks of 
matter fluctuations are expected to be most strongly clus- 
tered (Kaiser 1984; also Kauffmann et al. 1999b; Benson 
et al. 2001). Indeed the galaxies in the highest redshift bin 
(1.0 < z < 1.5) have a median absolute rest-frame / mag- 
nitude of —22.5 + 51og/i compared with —19.9 + 5 log h in 
the lowest redshift bin and about 1-1.5 magnitudes brighter 
than L*. A similar dependence of ro on absolute magni- 
tude has been measured for galaxies at low redshifts in the 
2dF Galaxy Redshift Survey (Norberg et al. 2001a). The 
H < 20.5 sample is on average fainter and the increase in r* 
to higher redshift is proportionally less pronounced. If these 
r* values are compared with the ro values quoted in Nor- 
berg et al. (2001a) for samples with similar median absolute 
magnitudes to factor out this effect, then the ratio r„/ro ~ 1 
in all redshift bins. 

Of more interest is the evolution of r*(z) for galax- 
ies in a fixed absolute magnitude range and in particular 
for galaxies selected in the near-IR where absolute magni- 
tude traces total stellar mass. Unfortunately the sample size 
available for this paper limits what we can do at this stage, 
however Fig. ^ shows r*(z) for two photometric redshift 
bins with median redshifts z m ~ 0.6 and 1.0 and —23.2 < 
Mi — 51og/i < —21.2. These results are also summarized in 
Table ^| Though there is still some increase in average abso- 
lute magnitude with redshift within the selected range, the 
spatial correlation length does not change significantly with 
redshift. Isolating only the spectral- type E and Sbc galax- 
ies (to reduce the effect of the variation with redshift of the 
spectral type composition of the sample) further reduces the 
correlation lengths in the higher redshift bin relative to the 
lower redshift bin. This nearly flat evolution of r» for lumi- 
nous red-selected galaxies to z ~ 1 is consistent with model 
predictions for massive red or early-type galaxies (e.g. Kauff- 
mann et al. 1999b) and the merging model shown in Fig. |37| 
In contrast, previous optically-selected surveys have noted a 
drop in r* over the same redshifts, ranging from a slight de- 
crease (Carlberg et al. 2000; Hogg et al. 2000; Teplitz et al. 
2001) to a decrease of up to ~50 per cent (Le Fevre et al. 
1996; Adelberger 2000; Cabanac et al. 2000). Depending on 
the selection band, optical surveys include a mixture of red 
and blue galaxies at low redshifts but become increasingly 
dominated by bluer spectral types at higher redshifts as red 



galaxies drop out of the optical filters. This selection effect 
could lead to a decrease in the observed clustering. Con- 
versely, near-IR selection avoids this effect. Likewise, models 
(Kauffmann et al. 1999b) predict that the typical halo mass 
of galaxies selected in a fixed rest-frame absolute magnitude 
range decreases more rapidly with increasing redshift for 
bluer selection bands, leading to a corresponding decrease 
in r, to z ~ 1.5 for these galaxies, which is less pronounced 
for redder rest-frame selection bands. 

Also in Fig. |^we plot r*(z) for galaxies selected to have 
absolute B magnitude in the range —21.2 < Mb — 5 log h < 
— 19.2 along with a measurement at z ~ 0.1 for galaxies 
in a similar absolute B magnitude range in the 2dF Galaxy 
Redshift Survey (Norberg et al. 2001a). Since we use 7 = 1.8 
while Norberg et al. use 7 = 1.68 for this particular sample, 
it is useful also to calculate erg - the r.m.s. fluctuations in the 
galaxy distribution at the scale of 8 /i _1 Mpc - which is in- 
dependent of 7. Following Peebles (1980) and Magliocchetti 
et al. (2000), we calculate ag from r„ and 7 using 



where 



r,(z) 



1/2 



72 



( 3 _ 7 )(4_ 7 )( 6 _ 7 )27- 



(16) 



(17) 



The resulting values are cr 8 = 0.95±0.07, 1.01±0.17 and 
0.97±0.24 at z m ~ 0.1, 0.6 and 1.0 respectively, showing 
little evolution with redshift. 

A potential source of error in our spatial clustering es- 
timates arises from the differences between the photometric 
redshift distribution of a sample of galaxies and the sample's 
actual redshift distribution. For example, a sample chosen to 
have photometric redshifts in the interval 0.5 < z < 1.0 will 
include some galaxies with actual redshifts outside this range 
which will dilute the measured clustering signal leading to an 
underestimate of r* . Other situations can result in biases in 
the opposite direction. Since particular SED/redshift combi- 
nations give more robust photometric redshifts than others 
the effect can vary depending on the sample. It is possible 
to address this problem to some extent with Monte Carlo 
simulations using simulated catalogues based on the data 
or on models with noise added to mimic the observations 
and, in Limber's equation, using N(z) based on the original 
redshifts for photometrically-selected subsamples. We found 
that the results presented here for spectral-type, magnitude 
and colour selected samples did not change by more than 
a few percent with respect to this effect while the values 
of r» in photometric redshift bins increased (as expected) 
by ~10-20 per cent. This does not affect our conclusions 
significantly. 



10 DISCUSSION 

In this paper we have presented an analysis of 4019 H < 20.5 
galaxies in 744 arcmin 2 of the LCIR optical/near-IR survey. 
Photometric redshifts place the majority of these galaxies 
in the redshift range < 2 < 1.5 with a median red- 
shift of ~ 0.55. Our analysis includes comparisons of colour 
distributions, number counts, N(z) distributions for differ- 
ent spectral types and ERO number counts with a semi- 
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Table 9. The number of galaxies N, median photometric redshift z m , median absolute B and / magnitudes Mb and Mj (h = 1), 
amplitude A of w(9) at 1 arcmin and spatial correlation lengths r in /i~ 1 Mpc (comoving coordinates) for absolute B and I magnitude 
limited samples in two photometric redshift bins. The Mb samples are selected to have —21.2 < Mb — 51og/i < —19.2 while the Mi 
samples are selected to have —23.2 < Mj — 51og/i < —21.2. We fix 7 = 1.8. The correlation length is calculated for two cosmologies 
Qo = 1| f^A = an d &o = 0.3, £7a = 0.7. N(z) is derived from photometric redshifts. The errors on ro (e = —1.2) are approximately la 
and are carried through from the + w(0))/DD errors on w(6). The errors on the other correlation lengths are of similar proportions. 
r t (zm) is listed for e = 0. For e = —1.2, r»(z) equals ro for any z. Since galaxies are localized into redshift bins, r,(z m ) is nearly 
independent of an assumed t. Extra statistics are included in Table |l0| in the appendix. 
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Figure 36. The comoving correlation length, r*(z m ) (7 = 1.8), 
where z m is the median photometric redshift of the sample, for 
photometric redshift bins in the H < 20.0 and H < 20.5 cata- 
logues, r* (z m ) is derived via Limber's equation from the measured 
amplitudes of w(9) using the photometric redshift N(z) distribu- 
tions and two cosmologies — f2p = 1, Q A = and f2o = 0.3, 
Qa = 0.7. The error bars are approximately la. The increase in 
r* (2) to higher redshift is probably a result of the increasing av- 
erage luminosity with redshift for an apparent magnitude limited 
survey (cf. Fig. [j?]). 



analytic hierarchical merger model and no-evolution and 
passive-evolution models. We also calculate angular and spa- 
tial clustering statistics for different magnitude, colour, spec- 
tral type and photometric redshift-selected subsamples out 
to z ~ 1.2. The main results are summarized below. 

• The LCIR number counts (Fig. ^3]) agree well with pre- 
vious surveys. The faint end (18 < H < 20) slope in the H 
number counts is 0.38 which compares with 0.31 for H > 20 
(Yan et al. 1998) and 0.47 for H < 19 (Martini 2001a). The 
number density of EROs with R — H > 4 and H < 20 is 0.5 
arcmin -2 which agrees well with 0.5 arcmin -2 for R—K s > 5 
and K s < 19 in Daddi et al. (2000a). 

• Both the total H number counts and ERO number 
counts are well-fitted by the no-evolution model, however 
the no-evolution model underestimates the number of galax- 
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Figure 37. The comoving correlation length, r*(z m ), where z m 
is the median redshift of the sample, for samples selected by abso- 
lute magnitude in two photometric redshift bins. The correlation 
length r-*(z m ) is derived via Limber's equation from the measured 
amplitudes of w(6) assuming 7 = 1.8 and using the photometric 
redshift N(z) distributions and an Co = 0.3, Ha = 0.7 cosmology. 
The error bars are approximately la. Results are expressed in co- 
moving coordinates with h = 1. Also plotted is a measurement 
from the 2dF Galaxy Redshift Survey for galaxies at z ~ 0.1 in a 
similar absolute B magnitude range (using 7 = 1.68) (Norberg et 
al. 2001a). The lines correspond to various models of clustering 
evolution (assuming Qq = 0.3, = 0.7): a merging model nor- 
malized to ro of early-type galaxies in the APM (Loveday et al. 
1995) (long dashes) and normalized to all galaxies in the APM 
(short dashes), and a galaxy conserving (Fry 1996) model sim- 
ilarly normalized (dot-dashed line and dotted line respectively). 
The reader is referred to Moustakas & Somerville (2001) for de- 
tails of these models. The solid line models the evolution of clus- 
tering in the underlying dark matter. The observations are most 
consistent with the merging model, though the error bars still 
allow considerable variation. 
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ies in the highest redshift bins, indicating some luminosity 
evolution or merging in the observations. 

• The passive-evolution models fail to fit the observed 
N(z), predicting too many bright galaxies at high red- 
shifts. Similarly, passive-evolution models generally predict 
too many EROs, especially if the majority of massive ellip- 
tical galaxies are hypothesised to form at high redshifts and 
evolve passively thereafter, though it is certainly possible 
that this scenario is true for some fraction of the elliptical 
galaxy population. Fine-tuning of star formation histories, 
formation redshifts and the inclusion of dust-obscured star- 
formation may be used to improve the match between these 
models and the observations (e.g. Daddi et al. 2000b) but 
need to be tested for consistency with observations at all 
wavelengths. 

• The semi-analytic model fits the observed total N(z) 
but predicts too few spectral-type E galaxies and too few 
EROs relative to bluer spectral types. For example in the 
magnitude range 19 < H < 20, 18 per cent of observed 
galaxies have colours R — H > 4 while in the semi-analytic 
model the fraction is only 4.3 per cent. However the semi- 
analytic model does appear to produce enough bright galax- 
ies at z ~ 1. Hence the deficit of EROs could be due, for 
example, to some process inhibiting late star formation in 
massive galaxies at z ~ 1 or to some of the EROs being very 
dusty ULIRG-like galaxies, either of which may not be mod- 
elled properly. Existing spectroscopic, sub-mm and morpho- 
logical observations suggest that both dusty-starbursts and 
evolved ellipticals are present in the ERO population with 
the larger fraction being evolved ellipticals. While strong 
clustering of EROs indicates that they are mostly massive 
galaxies in a restricted redshift range, it does not necessarily 
distinguish between evolved elliptical galaxies at z ~ 1 and 
massive ULIRG-type galaxies at z ~ 1 or greater, both of 
which may be progenitors of present day massive elliptical 
galaxies (Blain et al. 1999b; de Zotti et al. 2001; Scott et al. 
2001). Other evidence such as the tightness of cluster ellip- 
tical colour-magnitude relations (Ellis et al. 1997) and the 
slow evolution of the fundamental plane zeropoint in cluster 
and field ellipticals (van Dokkum et al. 2001) also indicate 
that late star formation is inhibited in some massive ellipti- 
cals though other z < 1 elliptical galaxies (particularly the 
less massive ones) show features of recent star formation 
(e.g. Franceschini et al. 1998; Abraham et al. 1999; Menan- 
teau, Abraham & Ellis 2001; Im et al. 2001) possibly induced 
by merger events. 

• For the full galaxy sample we find ro ~ 5.5-6 /i -1 Mpc 
(e = —1.2), which is slightly larger than the value of ro ~ 
5-5.5 h~ Mpc typically measured in low-redshift optically- 
selected surveys (Loveday et al. 1995; Postman et al. 1998; 
Carlberg et al. 2000; Norberg et al. 2001a). The enhanced 
clustering in the LCIR survey is expected as a result of the 
increased fraction of red galaxies in a near-IR-selected sam- 
ple as opposed to an optically-selected sample. 

• We find that redder (spectral-type E + Sbc) galaxies 
are more strongly clustered than bluer (spectral-type Scd 
+ Im) galaxies with a factor of ~ 2 difference in ro. This 
is consistent with the results of low-redshift galaxy surveys 
(Loveday et al. 1995; Hermit et al. 1996; Cabanac et al. 2000; 
Brown et al. 2000). 

• We find that the angular clustering amplitude of EROs 
defined by colours R — H > 4 or I — _ff>3isa factor 



of 2.5-4 times that of the whole galaxy population at the 
same limiting H magnitude (Fig. R3j). This is partly due to 
the restricted redshift range (z ~ 0.8-1.4) of these galaxies 
(Fig. ^ & pj| . The corresponding spatial correlation lengths 
are r,(z = 1) ~ 7.5-10.5 h^Mpc (cr 8 = 1.3-1.7). At z ~ 1, 
only galaxies at the bright end of the luminosity function 
make it into the sample and for such galaxies a fairly large 
correlation length is to be expected (Norberg et al. 2001a). 
The correlation lengths of the EROs are a factor of ~1.8 
times those of R — H < 4 and I — H < 3 galaxies selected in 
the same redshift range - a difference comparable with that 
found between red and blue galaxies at low redshift. 

• The correlation length r*(z) in photometric redshift 
bins increases dramatically from z — to z ~ 1.2 (Fig. 
|ji| ) which we interpret to be a result of the systematic in- 
crease in average luminosity of the sample to higher red- 
shifts as a result of selecting by apparent magnitude. When 
r»(z) is calculated for galaxies selected by absolute rather 
than apparent magnitude there is no significant increase in 
r,(z) with redshift over this range (Fig. ^). This is consis- 
tent with semi-analytic model predictions for red galaxies 
(Kauffmann et al. 1999b). 

In the present paper, in calculating correlation statis- 
tics for various subsamples of the data, we are still restricted 
by some of the problems that our photometric redshift ap- 
proach was designed to overcome - specifically because of 
low sample sizes we are unable to calculate correlation statis- 
tics as a function of spectral type and redshift simultane- 
ously and the results we do show have large error bars. Sim- 
ilarly we are generally unable to impose absolute magnitude 
limits. With a larger sample size (e.g. the square degree of 
the completed LCIR survey) it will be possible to compare 
the clustering of galaxies of the same spectral type and in- 
trinsic luminosity at different redshifts. In particular for a 
near-IR-selected sample this may be closely tied to the to- 
tal stellar mass of galaxies rather than being sensitive to 
recent star-formation activity as are optically-selected sam- 
ples. The larger sample will also allow us to push out to z ~ 
1.5-2 with better statistics. Addition of extra filters, giv- 
ing more complete spectral coverage, will allow us to place 
stronger constraints on the effects of dust on our results (Fig. 
pg| ; see also Moustakas 1999; Pozzetti & Mannucci 2000). 

As follow-up to this survey we are engaged in a pro- 
gramme of follow-up spectroscopy with the Keck and Mag- 
ellan telescopes in order to better characterize the ERO pop- 
ulation and to confirm photometric redshift accuracy in the 
range 0.5 < z < 1.5. 



11 SUMMARY 

• Simple passive-evolution models generally over- 
estimate the observed number of high-redshift galaxies, in- 
cluding EROs. 

• The hierarchical merger model fits the overall N(z) but 
under-estimates the number-density of EROs by a factor of 
four (Fig. ||&0). 

• For all ~4000 galaxies at H < 20-20.5 we find comoving 
r ~5.5 h~ 1 Mpc. 
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Figure 38. Comparison of a typical elliptical galaxy spectrum 
at z = 1 with an irregular galaxy spectrum at z = 1.5 with two 
magnitudes of extinction in the V band (CABKKS). Both have 
very red R — K colours — the colour most frequently used in the 
literature to define EROs. On the otherhand, an R — H or I — H 
red colour cut is more sensitive to a prominent 4000 A break 
than the smoother cut-off in dusty galaxies, while the intervening 
Z and J filters may be used to distinguish evolved galaxies from 
dusty non-evolved galaxies. We note that other reddening laws, 
such as Seaton (1979; Milky Way), Prevot et al. (1984; SMC) and 
Bouchct et al. (1985; SMC), produce redder R — K colours (~6.6) 
in the right-hand plot for the same Ay. 



• We find that red galaxies have ro twice that of blue 
galaxies. 

• For EROs, r ~7.5-10.5 h~ 1 Mpc which is twice that of 
non-EROs selected in the same redshift range (Table |^). 

• For an apparent H magnitude-selected sample, ro in- 
creases dramatically with redshift (Fig. |36| ) but for an ab- 
solute rest-frame / magnitude-selected sample there is no 
significant increase in ro toz~l (Fig. |t]). 
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Figure 39. H — K, J — K, J — H and R — I colours for various SEDs as a function of redshift, for the purpose of conversion between 
the various ERO colour selection criteria used in the literature. The SEDs include the empirical CWW templates E, Sbc, Scd, Im and 
evolving BC GISSEL'98 SEDs with single stellar population (SSP) burst, exponentially decaying star formation with time-scale r = 1, 
3, 5 and 15 Gyr and constant star formation rate (SFR), all with formation redshift Zf = 10 (see also §[].2). An Slo = 0.3, S1a = 0.7, Ho 
= 70 km s — 1 Mpc - 1 cosmology has been assumed. The BC SEDs have been reddened by Ay = 0.0, 1.0 and 5.0 (CABKKS). In each 
panel the redder SEDs (E, SSP burst) are towards the top while the bluer SEDs (Im, constant SFR) are towards the bottom. At z = 4 
the galaxies are ~1.1 Gyr old in this cosmology. 
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Table 10. The number of galaxies N, median photometric redshift z m , median absolute B and / magnitudes Mg and Mj (h = 1), amplitude A of w(9) at 1 arcmin and spatial 
correlation lengths r in h~ 1 Mpc (comoving coordinates) for various subsamplcs of the LCIR survey selected by magnitude, colour, spectral type and photometric redshift. The correlation 
length is calculated for two cosmologies Qo = 1, = and fin. = 0-3, fiA = 0.7, N(z) is derived from photometric redshifts and 7 is fixed to 1.8. The errors on rn (e = —1-2) are 
approximately 1<t and are carried through from the + w(0))/ DD errors on w(0). The errors on the other correlation lengths are of similar proportions, ro = r,(z = 0), r„(z m ), 

r„ (z = 0.5) and r*(z = 1) are listed for e = 0. For e = —1.2, r»(z) equals ro for any z. The subsamples marked Mb are selected to have absolute B magnitudes in the range 
—21.2 < Mb — 51ogh < —19.2 while those marked Mj are selected to have absolute / magnitudes in the range —23.2 < Mj — 51og/i < —21.2. 

sample catalogue N z m U B Mj A r'= ~ 1 ■ 2 r* =0 r%=° (* m ) '^ = °(0.5) »-J =0 (1.0) r o =_1 ' 2 r = ° r' = ° {z m ) r« = °(0.5) rj =0 (1.0) 
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